DataMeet 6 was a 2 day, Data Science Hackathon that was organised by a BFSI company, Zone Startups and DataMeet Mumbai. The Hackathon took place in the Bombay Stock Exchange Building at Zone Startup’s office. Twelve teams participated. These included teams of young data enthusiasts and specialist data scientists teams from companies like TCS and Housing.com.
The BFSI company opened up 80GB of it’s real transactional data in a secure environment to the participating data enthusiasts.
The teams were expected to analyze the data and draw out insights that would be relevant to their use case scenarios such as Health Bankruptcy or pull out a trend which is hidden and unknown to the BFSI company. Teams were free to use any tool of their choice from R, Python, Tableau, etc.
Each team was provided an individual secure Oracle DB connection from which they could query the data but not download the data. The Oracle DB connections were opened only to the Static IPs of Zone Startups Office and the data to and fro from the servers was monitored to ensure against downloading of the data.
The day started with various teams analysing the raw data, tables, meaning of columns. The representatives from the BFSI company also gave a briefing about objectives.
Many of the young teams did not turn up on Day 2 due to complexity of the problem. At the end of Day 2, the judges from the BFSI company evaluated each team’s progress, gave feedback and suggestions.
Mumbai had it’s fourth data meet on December 6, 2014 with a total of 11 participants. Due to scheduling issues, the November meet-up was moved from last Saturday of the month to the first Saturday of December. This time the meet-up was held at Pykih’s office on 8th floor at Sardar Patel Institute Of Technology.
The speaker was Bhavin Dalal, Senior Technology Manager, from Hansa Cequity.
At Cequity, he plays multiple role not limiting to solution architect, consultant & project manager. While he has strong product framework knowledge , his expertise lies in data warehousing technologies.
Bhavin spoke on two main topics:
1. Data Cleaning – he explained what is Data Quality and which factors determine the quality of data. He briefed through the common Data quality problems faced while cleaning the data. He showed us an example where they faced problems while cleaning car data and how they solved it. He also explained data cleaning methods which will helped us to understand the approaches towards data cleaning, the importance to do data cleaning and some do’s and don’t while capturing data.
2. Visualising census data for better understanding India – here he gave us eye popping fact list revolving around the census data. This topic gave us the better understanding that there are plethora of data points which can be meaningfully used to come up with really good insights on Indian population.
The next data meet will be held on last Saturday of December 2014. Please follow the Mumbai Meet-Up Group to know about the details.