Unstructured Data Analytics for Policy

94-775

Units: 6

Description: Many organizations need to analyze large amounts of data such as text, images, audio, and video to discover useful information. For example, a company may want to monitor how the public discusses its products in social media, or a forensics team may need to discover the contents of disk drives seized by law enforcement. A recurring issue is that we often do not know what structure is present in the data initially. This course provides students with an understanding of common and emerging methods of organizing, summarizing, and analyzing large collections of this unstructured data (“unstructured data analytics”). There is some programming although the emphasis of the course will be more on concepts. For students interested in more hands-on experience with the coding aspects, consider taking 95-865 instead. Prerequisites: None

Learning Outcomes: By the end of the course, students are expected to have developed the following skills. Skills are assessed by the homework assignments and the final exam. * Recall and discuss common methods of conducting exploratory and predictive analysis of unstructured data; * Write Python code for exploratory and predictive data analysis; and * Apply unstructured data analysis techniques discussed in class to solve problems faced by governments and companies.

Prerequisites: (90-819 Intermediate Programming with Python or 95-888 Data Focused Python) AND 95-791 Data Mining.

Syllabus: 94-775_Unstructured_Data_Analytics_for_Policy_Syllabus_S19.pdf