DATA EXPLORATION, CLEANING, AND INTEGRATION FOR DATA SCIENCE
Big Data is often said to deal with four Vs: volume, velocity, variety, and veracity. The focus is on variety and veracity challenges, which often arise in data science projects. In many such projects, data is often incorrect, hard to understand, and come from a variety of sources. Data scientists often spend 80% of their effort to explore, clean, and integrate this data, before analysis can be carried out to extract insights. As a result, managing variety and veracity has received significant attention. Study these topics, understand their challenges, and discuss solutions. These solutions often require data management, machine learning, big data scaling, cloud, crowdsourcing, and user interaction techniques. Knowledge of machine learning/AI [COMP SCI 540], databases [COMP SCI 564] and Python [COMP SCI 320] recommended.
Not Reported
Not Reported
No change from Historical
No change from Historical
No change from Historical
No change from Historical
Cumulative Grade Distribution
Sorted by ratings from Rate My Professors
Similar Courses
Sorted by ratings from Rate My Professors
No instructors found.
Visual representation of course prerequisites and related courses.
Note: We aren't showing all possible requisite relationships, only those that are directly relevant to the course.
Loading Graph...
No schedule information available for this course.