Category Archives: News

Demonetisation with Srinivasan Ramani

January 22, 2017 Thejesh GN

Srinivasan Ramani is Deputy National Editor who works with data at The Hindu. He has been a long time member of DataMeet community. This week I caught up with him to talk about Demonetisation move by Government of India.

Show Notes

Government Press Release -PDF
PM Modi Announces Notes Ban In Anti-Corruption Move, Millions Face Cash Crunch
Demonetisation: 1978, the Present and the Aftermath
livemint says In 1946, Rs1,000, Rs10,000 banknotes were withdrawn. In 1954, Rs1,000, Rs5,000, Rs10,000 notes were reintroduced but again demonetized in January 1978
As Much As 97% Of Banned Notes Are Back In Banks: Report
RBI Data Release portal
‘Rs. 400 cr. in fake notes to be flushed out’
Demonetisation: Rs 2.5 lakh crore won’t come back into banking system, says SBI
Rs 12.44 lakh crore demonetized notes back in the system: RBI
Rs 11.5 lakh crore deposits! Did RBI double-count both new and old notes?
Double counting of deposits ruled out
Cash crunch: Analysts cut India GDP growth forecast
Farmers forced to dump their produce as note ban turns bumper crop worthless
Demonetisation fallout: ADB reduces India GDP growth forecast to 7%
Demonetisation is for the long run

Crossposted.

News

OpenPostbox.org

February 13, 2016 Thejesh GN

We got a chance to talk to members of Karnataka Philatelic Society about OpenPostBox. They are very interesting set of people. They have also started sending me the postbox pictures using WhatsApp along with location. Now I need to find an efficient way to extract them and insert into my database.

As of now I am thinking of Export -> Parse -> Insert. Working on it. If you have any ideas do email me.

Details of the meet are on my personal blog if you like to read.

Data, News, Reports

Five Years of DataMeet Discussions

February 7, 2016 Thejesh GN 1 Comment

We consider 26/01/2011 as DataMeet birthday. Thats the day we talked about starting DataMeet and hence it is the birthday. But the first email to the group was sent by S.Anand on 27/01/2011. Its been five years since that first email. I took this opportunity to scrape the email list to see how we are doing and what we talked about in last five years.

Growth

Activity

Members have started 1525 and have sent in total 4570 emails. But most important is how many participate.

Category	Members
No Emails	855
1 Emails	184
2 Emails	75
3 Emails	43
More than 3	189

Discussions

Go have a look at full view of the traffic graph. Except for few peaks the group has been fairly consistent.

Starters

We have discussed about 1525 in last five years. Here is the list of top 20 starters.

author	total topics started
Nisha Thompson	199
Thejesh GN	164
sumandro	71
Sridhar Gutam	64
srinivas kodali	36
Gautam John	30
Sajjad Anwar	28
Pranesh Prakash	27
bawaza…@gmail.com	27
Venkatraman.S.	23
satyaakam	22
S Anand	21
Balaji Subbaraman	20
Nikhil VJ	19
Justin Meyers	15
Sanky	15
Dilip Damle	14
Maya Indira Ganesh	13
Shree	13

First Responders

The first responders are important when someone posts a question. They are the first ones to respond to the questions. As you would have guessed the list is different from the starters list.

author	number first response
Devdatta Tengshe	36
Gautam John	36
Nisha Thompson	57
srinivas kodali	28
Thejesh GN	27
Sajjad Anwar	21
satyaakam	20
Arun Ganesh	16
Avinash Celestine	15
Venkatraman.S.	15
Anand Chitipothu	14
sumandro	13
Dilip Damle	10
JohnsonC	10
S Anand	10
Gora Mohanty	9
Meera K	9
Sabarish Karunakar	9
Nikhil VJ	8

Part of many discussions

These are the members who have participated the most.

author	total_emails_sent
Nisha Thompson	397
Thejesh GN	297
Gautam John	158
srinivas kodali	128
sumandro	109
Sajjad Anwar	93
Arun Ganesh	88
Dilip Damle	88
Devdatta Tengshe	85
satyaakam	83
Sridhar Gutam	81
Avinash Celestine	73
Justin Meyers	71
S Anand	68
Pranesh Prakash	67
Venkatraman.S.	64
Nikhil VJ	55
Raphael Susewind	55
Anand Chitipothu	51

Topics

We have discussed many many topics over years. But there are some popular topics. I have the list of topics by most replies.

Starter	date/time	topic
Karthik Shashidhar	2015-05-04 23:00:01	Shapefiles for "complete" India
megha	2014-04-10 14:10:21	MP/MLA Shapes
Srihari Srinivasan	2013-03-06 22:59:44	List of BMTC Bus stops
Nisha Thompson	2014-05-20 23:51:49	Logo Contest Voting!
S Anand	2016-02-01 18:31:38	PIN code geocoding
Siddarth Raman	2014-04-17 16:16:29	Parliamentary Constituency to Assembly Constituency to Ward linkages
Nisha	2013-04-15 09:44:21	April's Bangalore DataMeet
Gautam John	2012-04-14 09:49:50	I Change My City
Arun Ganesh	2011-03-14 11:23:25	Licensing crowdourced data projects
Sharad Lele	2015-11-27 19:59:49	Census of India seems to have maps of everything!

We also get quite a bit of traffic through search engines. So here is the list of top topics by views.

username	date_time	views	topic
Karthik Shashidhar	2015-05-04 23:00:01	12324	Shapefiles for "complete" India
S Anand	2016-02-01 18:31:38	4783	PIN code geocoding
srinivas kodali	2013-07-01 12:49:33	2291	GeoJson data of Indian states
Aashish Gupta	2014-02-24 10:23:12	763	1981 and 1991 district-wise census data
Justin Meyers	2014-07-26 22:05:13	668	Updated Taluk Shapefile!!
indro ray	2013-08-13 10:21:18	651	MCD Delhi Admin Boundary GIS map
My profile photo	2012-08-30 17:41:45	615	Bangalore – BBMP ward boundaries – shape files available now
megha	2014-04-10 14:10:21	556	MP/MLA Shapes
Kavita Arora	2012-09-13 23:32:25	546	Ward Wise data for Bangalore – 2011 census?
Renaud Misslin	2014-12-03 09:45:16	426	Delhi ward shapefile for census 2011 data

At last customary wordcloud of topics.

Of course all the scrapers and data is available on github. Go ahead make your own visualizations.

News

Nobel prize Winner Angus Deaton on the importance Open Data in India

Quote October 19, 2015 Thejesh GN

On Data{Meet} we have been talking about the importance of Open Data and quality of it. This year’s winner of the Nobel Prize for Economics Angus Deaton has similar point of view on the quality of open data. Whole article is worth reading, I am quoting a paragraph.

My work shows how important it is that independent researchers should have access to data, so that government statistics can be checked, and so that the democratic debate within India can be informed by the different interpretations of different scholars. High quality, open, transparent, and uncensored data are needed to support democracy.

I have used data from India’s famous National Sample Surveys to measure poverty. Perhaps the biggest threat to these measures is that there is an enormous discrepancy between the National Accounts Statistics and the surveys. The surveys “find” less consumption than do the national accounts, whose measures also grow more rapidly. While I am sure that part of the problem lies with the surveys—as more people spend more on a wider variety of things, the total is harder to capture—but there are weaknesses on the NAS side too, and I have been distressed over the years that critics of the surveys have got a lot more attention than critics of the growth measures. Perhaps no one wants to risk a change that will diminish India’s spectacular (at least as measured) rate of growth?

Source: TheWire
Picture credit: Nobel Prize

Delhi, News, OpenDataCamp

OpenDataCamp Delhi 2014 in Tweets

November 18, 2014 Thejesh GN 1 Comment

Last minute printing #odcdel14! http://t.co/KCj3lAbKkT @datameet http://t.co/P2JhEqWBZn

— Sajjad Anwar (@geohacker) November 14, 2014

https://twitter.com/ajantriks/status/533225676774449152

Tomorrow @ #odcdel14 @ Adianta School for Leadership and Innovation http://t.co/mBdBriOl2c

— Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ ಎನ್ | @thej@social.thej.in (@thej) November 14, 2014

Tomorrow @ #odcdel14 @ Adianta School for Leadership and Innovation http://t.co/189c6k2TrO

— Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ ಎನ್ | @thej@social.thej.in (@thej) November 14, 2014

So @datameet is organising #ODCDel14 tomorrow. Feat. Yamini from @AccInitiative, @rukmini_shrini and more: http://t.co/P7cjsbtBlQ

— Gautam John | https://lou.lt/@gkjohn (@gkjohn) November 14, 2014

@datameet there is an interesting conference going on urban issues in delhi, they got cool viz http://t.co/6vpDe3HR2w #UADelhi #ODCDel14

— Srinivas Kodali (@digitaldutta) November 14, 2014

Pre #odcdel14 catchup at Monkey Bar, Vasant Kunj if anyone's up. @fakenisha, @thej and I are here. @datameet.

— Sajjad Anwar (@geohacker) November 14, 2014

Jetlagged but early. Rise and shine #odcdel14! http://t.co/npQR68FSd6 http://t.co/eLEAyESvR0

— Sajjad Anwar (@geohacker) November 15, 2014

@datameet #Delhi #odcdel14 registration starts! its a day for #opendata @akvo pic.twitter.com/bO6Ha7ygJ4

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Attending #ODCDel14 today? Tweet using the hashtag and I'll add you to our list of participants: https://t.co/kt8dxPXlgY

— Nasr ul Hadi (@nasrhadi) November 15, 2014

https://twitter.com/Sramach9/status/533465685624913920

Our director Yamini Aiyar is the keynote speaker at #ODCDel14 today! We'll be tweeting her talk and others through the day @datameet

— Accountability Initiative (@AccInitiative) November 15, 2014

At @DataMeet's Open Data Camp today? Tweet w/ #ODCDel14 to join this list of participants: https://t.co/7BCagVybwm pic.twitter.com/JVeEo5Ueeg

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Time to talk Data at Open Data Camp Delhi! #ODCDEL14 pic.twitter.com/HbQZpZjbsP

— Eric (@snootdude) November 15, 2014

The #opendata #delhi meet is about to start off and we are getting a packed house!! #odcdel14 @akvo pic.twitter.com/ws15hr09i1

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Screw jet lag, ready for Open Data Camp Delhi w00t! #ODCDel14 pic.twitter.com/kEHqfdc9Td

— Alex Barth (@lxbarth) November 15, 2014

https://twitter.com/Shobha_SV/status/533473456147664896

At the #odcdel14. This place is running out of chairs! m/

— Souvik Das Gupta (@souvikdg) November 15, 2014

#odcdel14 is on! http://t.co/6x3ywCIQxQ

— Sajjad Anwar (@geohacker) November 15, 2014

Yamini Aiyar of @AccInitiative speaks at #ODCDel14 pic.twitter.com/Erwiu7JTua

— Eric (@snootdude) November 15, 2014

Yamini Aiyer from #accountability initiative giving her inaugural talk at #ODCDEL14 @akvo @datameet pic.twitter.com/GxSV8CxTVC

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

New govt. sends signals that it wants to increase #transparency by using technology to put #opendata in public domain – Yamini at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

IVRS used for #MidDayMeals in #UP to collect citizen feedback: accessible technology to generate #opendata – Yamini at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

https://twitter.com/Sramach9/status/533475979570585600

#PublicPolicy in India moving towards evidence-based policy making: #opendata part of this larger picture – Yamini at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

"Data is deeply political" Yamini Aiyer at #ODCDel14 @datameet @akvo

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Yamini Aiyar: "data is a vehicle to facilitate conflict and restructure power relations" #ODCDEL14 @AccInitiative @EPoDHarvard

— Eric (@snootdude) November 15, 2014

Govt's using to tech to produce #opendata. @CMOfficeUP tracked #MidDayMeals consumption w/ IVRS. @AccInitiative's Yamini Aiyar at #ODCDel14.

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Information, #opendata empower citizens to demand answers and invert the usual dynamic between citizens and #government – Yamini #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

#opendata is deeply political: changes power relations between citizens and the state and also forces state restructuring: Yamini #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

If you have to keep using technology 2 track Government officials to do their work, then we maybe missing the woods for the trees #ODCDel14

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Yamini Aiyar: "need to go beyond technomanagerial perspective to think about design of institutions" #ODCDel14 @AccInitiative @EPoDHarvard

— Eric (@snootdude) November 15, 2014

Thinking about larger political questions behind #opendata and evidence critical to add value in a democratic setup – Yamini at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

There is something wrong with the system if your knee jerk reaction is to use data technologies to monitor work @yaminiaeyar #ODCDel14

— Namrata Mehta (@littlenemrut) November 15, 2014

https://twitter.com/Shobha_SV/status/533478678382919682

We track #education #budget data not just for efficiency but also to empower parents to know more about their children: Yamini at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

https://twitter.com/Shobha_SV/status/533479268425023488

Reformist bureaucrats prefer 2 use technology 2 fix things, because that is harder to resist internally. #YaminiAiyer at #ODCDel14

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

https://twitter.com/Shobha_SV/status/533479741232119808

#technology lever to push change: govt systems cannot resist deployment of tech! Makes bureaucrats think differently: Yamini #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

https://twitter.com/Shobha_SV/status/533480948361228290
https://twitter.com/Shobha_SV/status/533484852734349312

Full House! #ODCDEL14 pic.twitter.com/o0dRr4iTMc

— Nisha Thompson (@fakenisha) November 15, 2014

At Open Data Camp Delhi 2014 #odcdel14

— Vinay Rawat (@vinnipogo) November 15, 2014

lot of people working in public health data, water, and education! Great to see! #ODCDel14

— Nisha Thompson (@fakenisha) November 15, 2014

Type of data #ODCDel14 participants have been working with: transport, health, education, GPS, sensor-sourced, govt budgets, microfinance…

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Fascinating range of data expertise at #ODCDel14; nice 'Ever have I ever' game organised by @ajantriks🙂

— ss (@s_shashank_s) November 15, 2014

At Open Data Camp Delhi #ODCDel14

— Sanya (@sanya_29) November 15, 2014

Everybody has worked with data someway or the other at #ODCDel14 .

— Anuj Prakash (@anujprksh) November 15, 2014

How are we using data? Crime, toilets, medicine, education, activism, airport showers, women, fashion, robotics. #diversityofuse #ODCDel14

— Aditi Surie (@sosurie) November 15, 2014

http://t.co/lkjYtsXfEf #awesomesauce #ODCDel14

— Namrata Mehta (@littlenemrut) November 15, 2014

https://twitter.com/Sramach9/status/533491064523743232
https://twitter.com/ZahirKoradia/status/533491133335105536

Here is mapping of postboxes that I have been mapping #odcdel14 http://t.co/izvEet0rYe

— Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ ಎನ್ | @thej@social.thej.in (@thej) November 15, 2014

Work with education data? Find me to talk about the work we do at @klpdotorg! http://t.co/rnTp3kkLmh #ODCDel14.

— Sajjad Anwar (@geohacker) November 15, 2014

i work on traffic, transport data, transit planning. Come meet me at #ODCDel14

— Srinivas Kodali (@digitaldutta) November 15, 2014

had an interesting conversation on maps with @lxbarth #ODCDel14

— Srinivas Kodali (@digitaldutta) November 15, 2014

https://twitter.com/Sramach9/status/533501991201153026

Data collection with akvo! #ODCDEL14 pic.twitter.com/Owe0yXPukP

— Nisha Thompson (@fakenisha) November 15, 2014

#ODCDel14's #collectingdata talks begin with @Akvo's @Amitangshu discussing what HASN'T worked. pic.twitter.com/QD8Qp2jXQp

— Nasr ul Hadi (@nasrhadi) November 15, 2014

People who collect data are the most under invested in the data collection process @akvo #odcdel14 @AdiantaDOTorg

— Aishwarya Panicker (@aishpanicker) November 15, 2014

"People responsible for data collection remain a low investment priority"problemitizing quality@amitangshu #ODCDel14 pic.twitter.com/jLmtawqhIA

— Aditi Surie (@sosurie) November 15, 2014

A yummy south indian thali to explain 'data tools: managing expectations' #odcdel14

— Guneet Narula (@guneetnarula) November 15, 2014

#ODCDel14 – great energy, fascinating data stories from all across India: elections, health, transport, environment, mapping + more

— Alex Barth (@lxbarth) November 15, 2014

#ODCDel14
"we all cry with data"@fakenisha

— ashutosh singh (@juggernaut451) November 15, 2014

India WASH Forum's Depinder Kapur taking about WASH data experience. #ODCDel14 pic.twitter.com/AgujHFCHUa

— Nisha Thompson (@fakenisha) November 15, 2014

So, cry me a database. 😉 MT @s_shashank_s I cry with data, says @fakenisha at @DataMeet's #ODCDel14.

— Nasr ul Hadi (@nasrhadi) November 15, 2014

https://twitter.com/Shobha_SV/status/533504601379442688

Inside the brain of a surveyor on the field: “God knows what the department will do with this data”— @amitangshu at #odcdel14. Ha!

— Souvik Das Gupta (@souvikdg) November 15, 2014

Best #data on #water quality in #India came from the mines and minerals dept. Depinder Kapur at #ODCDel14 @WatSanCollabCou @akvo

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

https://twitter.com/Shobha_SV/status/533505132206370817

Important to understand who is collecting data and for whom #indiawashforum #ODCDel14 pic.twitter.com/IR3iZN6MEL

— isha (@ishaparihar) November 15, 2014

Responsible Data! RT @ishaparihar: Important to understand who is collecting data and for whom #ODCDel14 pic.twitter.com/AEKaaml4R4

— Sajjad Anwar (@geohacker) November 15, 2014

"Public health issues need more than postfacto data correlations." #WASH #data4dev #Odcdel14 @WatSanCollabCou

— Aditi Surie (@sosurie) November 15, 2014

Usage and needs data in Indian santiation is desperately needed : We have no idea who needs how much water! Depinder Kapur at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

Next at @DataMeet's #ODCDel14, @IndiaWASHForum's @DepinderKapur on how to overcome barriers in #collectingdata. pic.twitter.com/yOyKztriKp

— Nasr ul Hadi (@nasrhadi) November 15, 2014

@nasrhadi nice meeting you there at #ODCDel14

— Vinay Rawat (@vinnipogo) November 15, 2014

https://twitter.com/Shobha_SV/status/533506490640777218

Had to leave #ODCDel14 in between but the interesting talks are still ON.

— Vinay Rawat (@vinnipogo) November 15, 2014

#odcdel14 paper based forms still the most prevalent way to collect data, not just in India but world over says Zahir Karodia

— Namrata Mehta (@littlenemrut) November 15, 2014

@geohacker good to see you here at #ODCDel14

— Vinay Rawat (@vinnipogo) November 15, 2014

data collection through smart phone
-existing approaches
-challenges#ODCDel14

— ashutosh singh (@juggernaut451) November 15, 2014

mobile vaani
-voice based social media#ODCDel14

— ashutosh singh (@juggernaut451) November 15, 2014

Data priority, need assessment and beneficiary perception and social impacts-challenge of daTA in #WASH #ODCDel14

— isha (@ishaparihar) November 15, 2014

https://twitter.com/Shobha_SV/status/533507470870589441

zahir from gramvaani asks how to collect data without sending someone down to the field at #ODCDel14 @akvo @datameet pic.twitter.com/0vZ07OiyBa

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

@ZahirKoradia on @gramvaani challenges of data using voice calls only. #ODCDEL14 pic.twitter.com/YiD7kBwhUJ

— Nisha Thompson (@fakenisha) November 15, 2014

Open data? Pollution monitoring board on dead end lane in district secretariat, Muzaffarpur. #ODCDel14 @EPoDHarvard pic.twitter.com/kn5uw0d7mh

— Eric (@snootdude) November 15, 2014

"Paper-based forms still #collectingdata default worldwide. We do phone calls." @GramVaani's @ZahirKoradia #ODCDel14. pic.twitter.com/8tHX01d0WR

— Nasr ul Hadi (@nasrhadi) November 15, 2014

#odcdel14 such an important question to ask! What is the motivation to participate in a survey, or any research for that matter.

— Namrata Mehta (@littlenemrut) November 15, 2014

How do we automate speech recognition to collect data from IVRS responses? "We can't." says @GramVaani at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

speech recognition is still a challenge#ODCDel14

— ashutosh singh (@juggernaut451) November 15, 2014

https://twitter.com/Shobha_SV/status/533508246233829376

"What hasn't worked" data collection challenges @akvo #ODCDel14 pic.twitter.com/S7UPTzzCkj

— isha (@ishaparihar) November 15, 2014

How to collect data if no one is on the field? Zakir Koradia from @GramVaani explains how to use phones for it. #odcdel14

— Guneet Narula (@guneetnarula) November 15, 2014

Talks on collecting data at the #odcdel14 re-emphases on how ground realities differ from our imagination of a literate and high-tech world.

— Souvik Das Gupta (@souvikdg) November 15, 2014

https://twitter.com/Shobha_SV/status/533509041427722242

#odcdel14 omg, #gramvaani I want to hear everything you've collected!

— Namrata Mehta (@littlenemrut) November 15, 2014

https://twitter.com/Shobha_SV/status/533511798700654593

Representing data#ODCDel14

— ashutosh singh (@juggernaut451) November 15, 2014

And we move to #representingdata talks at #ODCDel14 w/ @EconomicTimes' @AC_Soc. Check out his DataStories.in pic.twitter.com/FvFk7pS5Up

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Representing data @ac_soc sharing his experience. #ODCDEL14 pic.twitter.com/wM17kalxWB

— Nisha Thompson (@fakenisha) November 15, 2014

Attending @datameet #ODCDel14 Some awesome speakers, thinkers and doers. pic.twitter.com/0KAB8AxeIs

— BALA (@TweetingBala) November 15, 2014

@ac_soc starts his #ODCDel14 talk on representing data. @akvo @datameet pic.twitter.com/Nil8NOioMj

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

"The narrative around the data is important. Ask yourself what is the story I want to tell" @ac_soc at #ODCDel14 @akvo @datameet

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Primary focus in #dataviz should be to build a story with your #data. Then focus on the best way to visualise it, says @ac_soc at #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

#ODCDel14
General design approach
always start with a Question
don't start with presentation

— ashutosh singh (@juggernaut451) November 15, 2014

Narratives are still paramount. Use data to supplement, compliment the story. @ac_soc #ODCDel14

— Aditi Surie (@sosurie) November 15, 2014

https://twitter.com/Sramach9/status/533513382054199296

Graphics Percival by the Brain #ODCDel14 pic.twitter.com/fwgeU1q1cG

— Arnold Subhashnagar (@markandeykaju) November 15, 2014

Understanding visual perception when representing data #ODCDel14 @ac_soc pic.twitter.com/xcYxrkW9Ew

— isha (@ishaparihar) November 15, 2014

A huge shift in the Indian #labour market is people moving away from #agriculture. @Ac_soc uses #dataviz to explore this shift: #ODCDel14

— Accountability Initiative (@AccInitiative) November 15, 2014

#ODCDel14 Update the flash talk slide deck if you are speaking this afternoon! http://t.co/74Vk2i9act

— Sajjad Anwar (@geohacker) November 15, 2014

datastories.in#ODCDel14

— ashutosh singh (@juggernaut451) November 15, 2014

@DataPortalIndia explains the genesis and philosophy behind http://t.co/jdZChSlVWw at #ODCDel14! #opendata @OpenDataIndia

— Accountability Initiative (@AccInitiative) November 15, 2014

Staying with #representingdata at #ODCDel14, @DataPortalIndia's Sunil Babbar discusses the govt's work on #opendata. pic.twitter.com/W4fKzq43Tw

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Even @DataPortalIndia has problems getting data from govt agencies says Sunil Babbar, NIC. #ODCDel14

— Aditi Surie (@sosurie) November 15, 2014

@DataPortalIndia has 11,200+ data sets from over 82ministries and 5states of india. Collected in only 2yrs #ODCDel14 pic.twitter.com/VqALyJrt2q

— BALA (@TweetingBala) November 15, 2014

https://twitter.com/ZahirKoradia/status/533517016200904705

Design Strategies ! Principle of Design ! #ODCDel14 pic.twitter.com/J7qaJOHGam

— Arnold Subhashnagar (@markandeykaju) November 15, 2014

Better tracking of administrative region changes is needed to enable time series analysis! #ODCDel14 @EPoDHarvard @DataPortalIndia

— Eric (@snootdude) November 15, 2014

Next on #representingdata at #ODCDel14, @Folography's @AdityaDipankar. Information design #FTW. M pic.twitter.com/fR3hplDFVr

— Nasr ul Hadi (@nasrhadi) November 15, 2014

Catching a few principles of design at #ODCDel14 pic.twitter.com/hgydRGKN7E

— BALA (@TweetingBala) November 15, 2014

@adityadipankar on design! #ODCDEL14 pic.twitter.com/KSWC9yeIAm

— Nisha Thompson (@fakenisha) November 15, 2014

Charts and Chart Designs !#ODCDel14 pic.twitter.com/gKnYoRAJlb

— Arnold Subhashnagar (@markandeykaju) November 15, 2014

Aditya Dipankar: Get over the defensiveness associated with design critiques! #ODCDel14

— Eric (@snootdude) November 15, 2014

Is open data translating into useful decisions? How far are we from real-time data analysis? #ODCDel14 @EPoDHarvard @DataPortalIndia

— Eric (@snootdude) November 15, 2014

@rukmini_shrini presents her data journalism experience at @the_hindu at #ODCDel14 pic.twitter.com/DaVA4zhtk9

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

https://twitter.com/Sreechand/status/533522447799042049

Final leg of the planned pre-lunch sessions at #ODCDel14. @TheHindu's @Rukmini_Shrini on #usingdata in journalism. pic.twitter.com/M20CC3xFFd

— Nasr ul Hadi (@nasrhadi) November 15, 2014

@rukmini_shrini taking about using data! #ODCDEL14 pic.twitter.com/OIT9K9gXRJ

— Nisha Thompson (@fakenisha) November 15, 2014

This what I am trying to do, to solve traffic #ODCDel14 pic.twitter.com/CzoBpDIqbm

— Srinivas Kodali (@digitaldutta) November 15, 2014

Great lineup for flash talks this afternoon #odcdel14. Want to share something? Post it!… http://t.co/5T4udnjOqp

— Sajjad Anwar (@geohacker) November 15, 2014

https://twitter.com/Shobha_SV/status/533523564872220672
https://twitter.com/Shobha_SV/status/533524293879988225

data collected in India is based on sampling, so interrogating how samples r arrived at is an important question @rukmini_shrini #ODCDel14

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

View this post on Instagram

A post shared by Thejesh GN ತೇಜೇಶ್ ಜಿ ಎನ್ (@thejeshgn)

translating numbers to news @rukmini_shrini #ODCDel14 pic.twitter.com/HLZQYzpaBd

— isha (@ishaparihar) November 15, 2014

Challenge in working with data: Source contains a JPEG in an excel sheet. #facepalm #odcdel14

— Souvik Das Gupta (@souvikdg) November 15, 2014

@sonamitra @cbga on using data and budgets #ODCDEL14 pic.twitter.com/jEvAWIpHsp

— Nisha Thompson (@fakenisha) November 15, 2014

Rukmini Shrini talks data journalism and http://t.co/EBPUi0icc4 #ODCDel14 @TheHindu @rukmini_shrini pic.twitter.com/ylldRWIgRN

— Eric (@snootdude) November 15, 2014

Next at #ODCDel14, @SonaMitra on how @CBGAIndia is #usingdata. pic.twitter.com/HU78CVDqCK

— Nasr ul Hadi (@nasrhadi) November 15, 2014

"As researchers we use complicated data, presenting it to a lay audience is difficult but paramount." Sona Mitra from CBGA #ODCDel14

— Aditi Surie (@sosurie) November 15, 2014

Prachi talks about legal aspects of data at #ODCDel14 pic.twitter.com/CfvyJwUsU2

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

simplifying and presenting the complicated govt financial data @CBGAIndia #ODCDel14 pic.twitter.com/l7XwYRqrma

— isha (@ishaparihar) November 15, 2014

https://twitter.com/mtwestra/status/533526864233771009

@mtwestra you should have been here! @akvo is supporting @datameet for #ODCDel14 and its a great event!

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

Praachi Misra and laws we should know for open data #ODCDEL14 pic.twitter.com/r6UOwo353W

— Nisha Thompson (@fakenisha) November 15, 2014

Is attendance monitoring just for managers? Public needs better access to http://t.co/VMGzUJR4Td micro data. #ODCDel14 @EPoDHarvard

— Eric (@snootdude) November 15, 2014

Praachi Misra citing Eastern Book Company & Ors vs D.B. Modak & Anr http://t.co/nzlamEKEXi #odcdel14

— Alex Barth (@lxbarth) November 15, 2014

Legal issues when #usingdata? Competition Commission of India's Praachi Misra gives #ODCDel14 the download. pic.twitter.com/QA8qmS6wat

— Nasr ul Hadi (@nasrhadi) November 15, 2014

https://twitter.com/ZahirKoradia/status/533528993450844161

Facts cannot be copyrighted, so shape files represent facts as boundaries are legally seen as facts? Still some clarity needed #ODCDel14

— Amitangshu| অমিতাংশু। (@amitangshu) November 15, 2014

https://twitter.com/ysprem/status/533530134859374593
https://twitter.com/Sramach9/status/533549017167179776
https://twitter.com/Sramach9/status/533549424761245696

Check out @iotakodali's work to open Indian transit data #ODCDel14 pic.twitter.com/aLuz2h62iZ

— Alex Barth (@lxbarth) November 15, 2014

#odcdel14 more education on data tools among students needed! Yes! #neverhaveieverbeenanundergard @ajantriks pic.twitter.com/11tIS87tpa

— Namrata Mehta (@littlenemrut) November 15, 2014

The data angst is palpable at #ODCDel14. Great flash talks

— Eric (@snootdude) November 15, 2014

Good food at opendatacamp delhi #ODCDel14

— Srinivas Kodali (@digitaldutta) November 15, 2014

https://twitter.com/Sramach9/status/533557403997200384

@lxbarth explaining how openstreetmap and @mapbox works #ODCDel14 pic.twitter.com/KcCdrH1zua

— Srinivas Kodali (@digitaldutta) November 15, 2014

the need for #standards echoing across conversations at #odcdel14

— Namrata Mehta (@littlenemrut) November 15, 2014

Talking @openstreetmap at #odcdel14. @lxbarth @Sramach9. pic.twitter.com/50EnRBcz77

— Sajjad Anwar (@geohacker) November 15, 2014

#odcdel14 approaching the PIN code data problem – thoughts, questions, ideas. Help! http://t.co/zgzkLxDn1V

— Sajjad Anwar (@geohacker) November 15, 2014

Mapbox session hackpad – https://t.co/doXgMN1VtG @datameet #ODCDel14

— Srinivasan Ramani (@vrsrini) November 15, 2014

Closing semicircle! #odcdel14 http://t.co/3D2wsmRz5j

— Sajjad Anwar (@geohacker) November 15, 2014

Closing of #ODCDEL14 wat a great day! Congrats to all @ajantriks and the DataMeet Delhi Chapter u all are amazing! pic.twitter.com/pHz2SGuURO

— Nisha Thompson (@fakenisha) November 15, 2014

https://twitter.com/ayushkray/status/533585062512443392

Met some really interesting folks at the #ODCDel14 today. @rukmini_shrini , @souvikdg @latentappy @s_shashank_s @rungta @ajantriks

— Aditya Dipankar (@adityadipankar) November 15, 2014

That → “@sosurie: Even @DataPortalIndia has problems getting data from govt agencies says Sunil Babbar, NIC. #ODCDel14”

— Souvik Das Gupta (@souvikdg) November 15, 2014

@adityadipankar @rukmini_shrini @souvikdg @s_shashank_s @rungta @ajantriks Ditto. 🙂 wonderful session today. #ODCDel14

— Arpita (she/her) (@latentappy) November 15, 2014

My favourite participant at #ODCDel14 today? Data Dog pic.twitter.com/uKrvcdkhpp

— Rukmini S (@Rukmini) November 15, 2014

https://twitter.com/rohithjyo/status/533628393678319616

TODAY: #OpenData Camp in Delhi #ODCDEL14 w/presentations by @AccInitiative @rukmini_shrini and EPoD's @snootdude & @Ravi_Suhag

— Evidence for Policy Design (@EPoDHarvard) November 15, 2014

EPoD's @snootdude and @Ravi_Suhag giving flash talk on MGNREGA Reports Dashboard at Open Data Camp Delhi #ODCDel14 pic.twitter.com/X1qa5Niixy

— Evidence for Policy Design (@EPoDHarvard) November 15, 2014

View this post on Instagram

A post shared by Thejesh GN ತೇಜೇಶ್ ಜಿ ಎನ್ (@thejeshgn)

A big thank you to @pykih @akvo @AdiantaDOTorg for making #odcdel14 possible.

— datameet (@datameet) November 15, 2014

Bangalore, Data, News

Rebuilding the Karnataka Learning Partnership Platform

October 10, 2014 Sajjad Anwar 1 Comment

The Karnataka Learning Partnership recently launched a new version of their platform. This post talks about why they are building this and also some of the features and details. This is cross-posted from their blog.

Over the past five months we have been busy rearchitecting our infrastructure at Karnataka Learning Partnership. Today, we are launching the beta version of the website and the API that powers most of it. There are still a few rough edges and incomplete features, but we think it is important to release early and get your feedback. We wanted to write this blog post along with the release to give you an overview of what has changed and some of the details of why we think this is a better way of doing it.

Data

We have a semi-federated database architecture. There is data from Akshara, Akshaya Patra, DISE and other partners; geographic data, aggregations and meta-data to help make sense of a lot of this. From our experience PostgreSQL is perhaps the most versatile open-source database management system out there, Especially when we have large amounts of geographic data. As part of this rewrite, we upgraded to PostgreSQL 9.3, which means better performance and new features.

Writing a web application which reads from multiple databases can be a difficult task. The trick is make sure that there is the right amount of cohesiveness. We are using Materialized Views in PostgreSQL. Materialized View is a database object that stores the result of a query in a on-disk table structure. They can be indexed separately and offer higher performance and flexibility compared to ordinary database views. We bring the data in multiple databases together using Materialized Views and refreshing them periodically.

We have a few new datasets – MP/MLA geographic boundaries, PIN code boundaries and aggregations of various parameters for schools.

API

The majority of efforts during the rewrite went into making the API, user interface and experience. We started by writing down some background. The exhaustive list of things that the API can do are here.

We have a fairly strong Python background and it has proven to be sustainable at many levels. Considering the skill-sets of our team and our preference for readable, maintainable code, Django was an obvious choice as our back-end framework. Django is a popular web development framework for Python.

Since we were building a fairly extensive API including user authentication, etc., we quickly realized that it would be useful to use one of the many API frameworks built on top of Django. After some experimentation with a few different frameworks, we settled on using Django-Rest-Framework. Our aim was to build on a clean, RESTful API design, and the paradigms offered by Rest-Framework suited that perfectly. There was a bit of a learning curve to get used to concepts like Serializers, API Views, etc. that Rest-Framework provides, but we feel it has allowed us to accomplish a lot of complex behaviours while maintaining a clean, modular, readable code-base.

Design

For our front-end, we were working with the awesome folks at Uncommon, who provided us gorgeous templates to work with. After lengthy discussions and evaluating various front-end frameworks, we felt none of them quite suited what we were doing, and involved too much overhead. Most front-end frameworks are geared toward making Single Page Apps and while each of our individual pages have a fair amount of complexity, we did not want to convert everything into a giant single page app, as our experience has shown that can quickly lead to spiraling complexity, regardless of the frame-work one uses.

We decided to keep things simple and use basic modular Javascript concepts and techniques to provide a wrapper around the templates that Uncommon had provided and talk to our API to get and post data. This worked out pretty well, allowing us to keep various modules separated, re-use code provided by the design team as much as possible, and not have to spend additional hours and days fighting to fit our code into the conventions of a framework.

All code, design and architecture decisions are in the open, much like how rest of our organisation works. You can see the code and the activity log in our Github account.

Features

For the most part, this beta release attempts to duplicate what we had in v10.0 of the KLP website. However, there are a few new features and few features that have not yet made it through and a number of features and improvements due in future revisions.

Aside from the API, there are a few important new features worth exploring:

The compare feature available at the school and pre-school level. This allows you to compare any two schools or pre-schools.
1. Planned Improvements: The ability to compare at all and any levels of hierarchy; a block to a block or even a block to a district etc.
The volunteer feature allows partner organisations to post volunteer opportunities and events at schools and pre-schools. It also allows users to sign up for such events.
1. Planned Improvements: Richer volunteer and organisation profiles and social sharing options.
The search box on the map now searches through school names, hierarchy (district, block etc.) names, elected representative constituency names and PIN Codes.
1. Planned Improvements: To add neighbourhood and name based location search.
An all new map page powered by our own tile server.
Our raw data page is now powered by APIs and the data is always current unlike our previous version which had static CSV files.
1. Planned Improvements: To add timestamps to the files and to provide more data sources for download.

Now that we have a fairly stable new code base for the KLP website, there are a few features from the old site that we still need to add:

Assessment data and visualisations of class, school and hierarchy performance in learning assessments needs to be added. The reason we have chosen not to add it just yet is because we are modifying our assessment analysis and visualisation methodology to be simpler to understand.
Detail pages for higher levels of aggregation – like a cluster, block and district with information aggregated to that level.
A refresh of the KLP database to bring it up to date with the current academic year. All these three have not been done for the same reason; because this requires an exhaustive refactor of the existing database to support the new assessment schemas and aggregation and comparison logic.

Aside from the three above, we have a few more features that have been designed and written but did not make it in to the current release.

Like the volunteer workflow, we have a donation workflow that allows partner organisations to post donation requirements on behalf of the schools and pre-schools they work with for things these schools and pre-schools require and other in-kind donations. For example, a school might want to set up a computer lab and requires a number of individual items to make it happen. Users can choose to donate either the entire lab or individual items and the partner organisation will help deal with the logistics of the donation.

Our next release is due mid-October to include the volunteer work flow and squish bugs. Post that, we will have a major release in mid-January with the refactored databases and all of the changes that it enables and all the planned improvements listed above. And yes, we do have a mobile application on our minds too.

The DISE application will be updated with the current years data as well by November. We will also add the ability to be able to compare any two schools or hierarchies by December.

So that’s where we are, four years on. The KLP model continues to grow and we now believe we have a robust base on which to rapidly build upon and deploy continuously.

For the record, this is version 11. 🙂

Bangalore, Data, News

Crosspost: Adding stress to a stressed area!

September 22, 2014 Nisha Thompson

A few weeks ago we held an Intro to Data Journalism Workshop. Josephine Joseph was in attendance, she regularly writes for Citizen Matters, Bangalore’s local paper that knows all. She was working on this story and has published it last week with Citizen Matters, I’m very happy to crosspost it here as a great example of local data journalism.

26 projects could: add 19,000 cars to Whitefield traffic, up water demand by 10.5 million litres

East Bangalore area, particularly Whitefield- KR Puram – Mahadevapura area, is on the prime real estate map. What are the projects coming up next? What are the implications?

Investing in real estate in Bangalore is a dream of any investor. However, is the growth of this sector in tune with the infrastructure that the city can handle?

A close look by Citizen Matters at 26 constructions coming up in Whitefield – KR Puram area in East Bengaluru shows some alarming observations. When the 8,000 flats are fully occupied, new residents will need 10,662.87 KL of water a day (equivalent of 1780 water tankers of 6000 Litres). More than 19,697 cars will add to Whitefield traffic.

Ministry of Environment and Forests (MoEF) rules make builders of projects of more than 20,000 sqm built up area, apply for an Environmental Clearance (EC) from the state, along with all the other permissions and NOC from BBMP, BWSSB, Karnataka Ground Water Authority (KGWA) to drill borewells prior to construction commencement.

The State Expert Appraisal Committee (SEAC) receives the applications and recommends checks and balances, prior to recommending a project for EC to the State Environment Impact Assessment Authority (SEIAA).

The SEIAA reviews project details, clarifies issues and only then is the EC issued. In cases where construction has begun without an EC, the builder is served with a show cause notice. The KSPCB can file cases against builders under the Environment Protection Act if they proceed with construction without an EC.

Read the rest over at Citizen Matters.

Great work Josephine!

News

Crosspost: The Hindu’s Rape Statistics Story

August 18, 2014 Nisha Thompson

A few weeks ago The Hindu’s Data Blog had a three part series looking at Data on rape cases in Delhi. It was a powerful story that had a lot of people talking and a good example of what can be done with data available. Rukmini S has written a piece detailing how she combed through the data to get the story.

Below is an excerpt.

How we put together the statistics that went into our investigation

“Delhi is better than most Indian cities for legal data journalism because it puts all district court judgements online – and promptly – and these can be text-searched. Ideally, I should have been able to scrape all judgements for ‘376’, the IPC section related to rape. However, I encountered a ton of issues that would have rendered a scraping tool useless (as far as I know – if you think there was a way I could have done it, do leave me a comment).

For one, while rape cases are sessions-triable, and so should show up as ‘sessions case” in the nomenclature, for some judges the cases were inexplicably classified as “criminal cases”. Then, while a simple text-search for ‘376’ should have been enough to get me all cases, the text-search function inexplicably collapsed around March 2014. With elections coming up, I had limited time to work on this and had to essentially open every single sessions court judgement and search for ‘376’ in each one. Luckily, the search function revived after two months.”

Read the rest here.

News

Missed Revolution? Or Too much Hype?

August 14, 2014 Nisha Thompson

There has been an interesting conversation about this article by Priya Rajasekar “India’s Media — Missing the Data Journalism Revolution?” in Global Investigative Journalism Network.

The thread, which you can find here, is a back and forth about whether India’s journalists are really missing out or if this is a global problem that needs more innovative solutions.

Feel free to add your thoughts to the comments or the thread.

News

Letter to NIC for a data portal to host public contributed datasets

April 4, 2014 Thejesh GN

Sumandro drafted a letter to be sent to NIC regarding the possibility of a data portal to host public contributed datasets, that is datasets originating from both governmental and non-governmental sources, but contributed only by non-governmental agencies and individuals.

We sent that letter to NIC this week. Below is the copy of it.

Letter to NIC for a data portal for public contributed datasets

Growth

Activity

Discussions

Starters

First Responders

Part of many discussions

Topics

Data

API

Design

Features

How we put together the statistics that went into our investigation

DataMeet is a community of Data Science and Open Data enthusiasts.