Apr 27, 2024  
2016-2018 Graduate Catalog 
    
2016-2018 Graduate Catalog [ARCHIVED CATALOG]

BIA 6305 - Preparation and Analysis for Big Data


(2)

This course will emphasize the extraction, transformation and preparation of data from traditional relational databases as well as more complex storage systems (such as Hadoop) for analytical purposes. Students will be introduced to data wrangling, munging and scraping of both structured and unstructured data. Students will also be introduced to parallel process for big data such as map reduce and query languages like HIVE. Exposure to any programming language is required. The primary software tool for this class will be Python as well as access to a standard rational database (Oracle or Mysql) and a Hadoop system.

Prerequisite: BIA 6301 , BIA 6311 , BIA 6312  or consent of the program director.