DISCLAIMER: Before you begin, be aware that the protagonist of any fictional account that this write-up might contain would be a male and would be referred to as a ‘he’ or at times, simply as ‘coder’. That’s not because the … Continue reading Open Source: The itch, the hustle and the merge
I recently participated in a weekend-long data science hackathon, titled ‘The Smart Recruits’. Organized by the amazing folks at Analytics Vidhya, it saw some serious competition. Although my performance can be classified as decent at best (47 out of 379 participants), it was among the more satisfying ones I have participated in on both AV (profile) and Kaggle (profile) over the last few months. Thus, I decided it might be worthwhile to try and share some insights as a data science autodidact. The problem The competition required us to use historical data to create a model to help an organization … Continue reading Data Science Competitions 101: Anatomy and Approach
tl;dr Code for scraping GSoC data from Google Developers website for the years 2005-08 and storing it as csv files can be found here: GSoC data digging on Github Two csv files will be created: org_numbers.csv (containing information about the … Continue reading Digging into GSoC Data: Scraping 101 (Part 1)
For the last two years or so, I have been involved with Python in an on-off relationship of sorts. I possessed syntactical familiarity, had completed a MOOC and couple of web projects using Django. I had also read up on … Continue reading Python in the open!