DATA EXPLORATION, CLEANING, AND INTEGRATION FOR DATA SCIENCE
Big Data is often said to deal with four Vs: volume, velocity, variety, and veracity. The focus is on variety and veracity challenges, which often arise in data science projects. In many such projects, data is often incorrect, hard to understand, and come from a variety of sources. Data scientists often spend 80% of their effort to explore, clean, and integrate this data, before analysis can be carried out to extract insights. As a result, managing variety and veracity has received significant attention. Study these topics, understand their challenges, and discuss solutions. These solutions often require data management, machine learning, big data scaling, cloud, crowdsourcing, and user interaction techniques. Knowledge of machine learning/AI [COMP SCI 540], databases [COMP SCI 564] and Python [COMP SCI 320] recommended.
Not Reported
Not Reported
Could not calculate change
Could not calculate change
Could not calculate change
No change from Historical
No data available
Sorted by ratings from Rate My Professors
Similar Courses
No data available
Sorted by ratings from Rate My Professors
No instructors found.
Visual representation of course prerequisites and related courses
Loading Graph...