units
FIT5196
Faculty of Information Technology
This unit entry is for students who completed this unit in 2015 only. For students planning to study the unit, please refer to the unit indexes in the the current edition of the Handbook. If you have any queries contact the managing faculty for your course or area of study.
Refer to the specific census and withdrawal dates for the semester(s) in which this unit is offered.
Level | Postgraduate |
Faculty | Faculty of Information Technology |
Offered | Not offered in 2015 |
Notes
Monash Online offerings are only available to students enrolled in the Graduate Diploma in Data ScienceGraduate Diploma in Data Science (http://online.monash.edu/course/graduate-diploma-data-science/?Access_Code=MON-GDDS-SEO2&utm_source=seo2&utm_medium=referral&utm_campaign=MON-GDDS-SEO2) via Monash Online.
This unit introduces tools and techniques for data wrangling. It will cover different data types and formats, data cleansing and pre-processing for analytics, for instance the handling of bad data, data structures used for efficient analytics, and efficient data storage. It will also cover SQL and NOSQL in-memory distribution, text mining, web analytics. Basic NLP Python will be used for implementation and case studies will be drawn from industry.
On successful completion of this unit, it is expected that student should be able to:
In-semester assessment: 100%
Minimum total expected workload equals 144 hours per semester comprising:
(a.) Contact hours for on-campus students:
(b.) Contact hours for Monash Online students:
(c.) Additional requirements (all students):
See also Unit timetable information