units
FIT5196
Faculty of Information Technology
This unit entry is for students who completed this unit in 2016 only. For students planning to study the unit, please refer to the unit indexes in the the current edition of the Handbook. If you have any queries contact the managing faculty for your course or area of study.
Refer to the specific census and withdrawal dates for the semester(s) in which this unit is offered.
Faculty
Offered
Monash Online
Notes
Monash Online offerings are only available to students enrolled in the Graduate Diploma in Data ScienceGraduate Diploma in Data Science (http://online.monash.edu/course/graduate-diploma-data-science/?Access_Code=MON-GDDS-SEO2&utm_source=seo2&utm_medium=referral&utm_campaign=MON-GDDS-SEO2) via Monash Online.
This unit introduces tools and techniques for data wrangling. It will cover the problems that prevent raw data from being effectively used in analysis and the data cleansing and pre-processing tasks that prepare it for analytics. These include, for example, the handling of bad and missing data, data integration and initial feature selection. It will also introduce text mining and web analytics. Python and the Pandas environment will be used for implementation.
At the completion of this unit, students should be able to:
In-semester assessment: 100%
Minimum total expected workload equals 144 hours per semester comprising:
(a.) Contact hours for on-campus students:
(b.) Contact hours for Monash Online students:
(c.) Additional requirements (all students):
See also Unit timetable information