We are live
Category Archives: Events
Reflections of Chennai’s Data Workshop from India Water Portal
Cross Posted from India Water Portal
Written by Aarti Kelkar-Khambete
This workshop organised by Transparent Chennai at The Institute of Financial Management and Research, Chennai was the outcome of the experiences of the earlier open data camp events organised by Transperant Chennai in Bangalore and Hyderabad, where there was a wide discussion among attendees who were excited by the potential of
data and the open data movement, but who did not have the necessary skills or technical background to work effectively with it.
It was felt that there was a much larger community of activists, researchers, and on-profits who could benefit from learning to use the kinds of tools presented at the camps. Thus, this event was planned differently from a data camp and focused on training activists, researchers and students to work with data where participants would learn about open data, data visualisation, spatial data and practical issues that come up when working with data in various forms.
The workshop thus aimed at helping the participants to:
- Understand various formats of data, diverse possibilities of data visualisation and effective tools for doing so, with a special focus on web-based tools
- Understand how to think through projects involving collection, processing and visualisation of data
- Develop a basic understanding of software packages and methods for visualising quantitative data, creating geo-visualisation and undertaking participatory mapping
- Understand the connection between data technologies and rights to access and use data.
Read the rest of the summary here.
Data Science meets Data Technology
While data is increasingly important in academia as well as in industry, the two worlds do not intersect each other all that often. DSDT is a monthly forum for sharing ideas about data across disciplines and industries. Each DSDT meeting will consist of two talks on a common theme, pairing a data scientist with a data technologist along with time for discussion. From the second session onward, we will have a tutorial and hacking session after the talks where we will learn how to work on understanding and analysing data sets relevant to that meeting’s theme. The schedule for the first meeting on the 18th at NIAS is given below.
Scrapathon 1: Rajasthan Rain Water Data
Cross Posted from Rajasthan Rainfall Data (1957 to 2011) by Thejesh GN
The Rajashtan rainfall data was scraped as part of Scrapathon held in Bangalore 21st July 2011. Intially I used scraperwiki, but the huge amount of data made it to timeout all the time 🙂 so I wrote a simple python script to do it.
Data is in the SQLITE file data.sqlite, in a table called rainfall. It has 6,61,459 rows.
Columns: DISTRICT, STATION, YEAR, MONTH, DAY, RAIN_FALL, PAGE_ID
PAGE_ID refers to the ID in the table webpages which lists the webpages from where these data where scraped. It will help you incase you want to cross check. The rest of the columns are self explanatory. I have signed the SQLITE database using my GPG keys and the signatures are inside the file data_gpg_signature.sig
You can download my public key from any keyserver or from biglumber.
You can download here as of now. I will try to make it available on torrent later.
PIN code mapping
Where: Skype ID: datameet
Agenda:
- Introductions [everyone, 10 seconds each]
- Guest speaker: Pete Warden
- How do we get more PIN codes mapped at http://pincode.datameet.org/
- Understanding OpenHeatMap
- Recording of the talk is available
Summary:
- We’ll go for bulk geo-coding as opposed to crowd-sourcing
- We’ll bulk source addresses. Please add any other sources you can think of
- The Postal College’s list of post offices
- Branch lists from banks such as SBI, or organisations like BSNL
- Telephone directories
- We’ll run them through Yahoo’s Placefinder, which is liberal in API limits and in licensing
- We’ll create Voronoi treemaps out of those (ideally as OpenStreetMap XML files)
Linked mentioned during the meet:
- http://developer.yahoo.com/geo/placefinder/ geocodes addresses liberally
- http://en.wikipedia.org/wiki/Voronoi_diagram is an explanation of Voronoi treemaps
- http://www.ptcmysore.gov.in/Publication.aspx has the official list of PIN codes. This will be available to us soon, but it does not have the data geo-coded
- http://mapstore.mapmyindia.com/pincode.html have geocoded data, but not for open use
Text & Geo processing
Where: Skype ID: datameet
Agenda:
- Introductions [everyone, 10 seconds each]
- Discussion on the most interesting visualisation you’ve seen recently
- Discussion on any sources of data you’ve come across
- Recording of the talk is available
Linked mentioned during the meet:
- Anand, on text analysis
- http://neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html
- http://sip.s-anand.net
- http://sip.s-anand.net/?url=http://en.wikipedia.org/wiki/Disaster
- http://www.s-anand.net/blog/visualisation-locating-hubs/
- Ananth: an interesting visualisation
- http://www.khanacademy.org/video/khan-academy-exercise-software?playlist=Khan%20Academy-Related%20Talks%20and%20Interviews
- Ananth, on geographic visualisations:
- http://www.engadget.com/2011/02/24/visualized-android-activations-mapped-geographically-chronolog/
- http://paulbutler.org/
- Ravi, on sources of PIN code information:
- http://www.indiapost.gov.in/rtimanual14.html
Quote of the call: “this call started with no agenda, but ended with quite hands full. happy. nothing more to add” — Balaganesh
R, Processing, Protovis
Where: Skype ID: datameet
Agenda:
- Introductions [everyone, 10 seconds each]
- Joining the mailing list, and sharing articles related to data science via RSS [Thej, 3 min]
- Splitting up sections on the “Wiki” between ourselves to populate content [Manu, 3 min]
- Adding to the directory and data store [An and, 2 min]
- Talks [The links have audio]:
- Learning R [Anand]
- Processing & Protovis [Arun & Venkat]
- Indian budget visualisation [All, 5 min]
- Taking the physical community forward [Bala, 2 min]
Links mentioned during the meet:
- Arun Ganesh: http://processing.org/ (example at http://blog.blprnt.com/blog/blprnt/just-landed-processing-twitter-metacarta-hidden-data)
- Venkatraman: http://vis.stanford.edu/protovis/ (exampole at http://www.janwillemtulp.com/nhqr/)
- S Anand: http://processingjs.org/
- Venkatraman: A simple viz challenge …check out the submissions http://flowingdata.com/2011/01/13/visualize-this-where-the-public-gets-its-news/
- Venkatraman: http://stats.stackexchange.com/
- Dwarakesh Venkatesan: http://www.4shared.com/get/mjccKxj5/Statistics_4_Utterly_Confused.html;jsessionid=08113111E9F6070A83B721D4DEAD59F2.dc283