Story

I started working on Ruby and Ruby on Rails in the first year of engineering and by the end of second year, I was selected for the Rails Girls Summer of Code, a program which accepts 20 teams all over the world (2 students per team) each year and provides them with an open-source project to work on. It is a global fellowship program for women and non-binary coders.

As a part of this program, I had the opportunity to deliver a talk at the dotJS conference 2017 in Paris. My talk was on my work during the summer and a short description of the RGSoC initiative.

Hackathons are one of my favourite places to be in. The energy, the buzz and the chance to work with an extremely unique and driven set of people to create a prototype are what gets me excited. All through my undergraduate, I attended numerous hackathons, some within India while some international.

The first hackathon I attended was the APNIC Hackathon in Nepal. APNIC(Asia Pacific Network Information Center) is the Regional Internet Registry administering IP addresses for the Asia Pacific. The aim of APNIC 45 was to promote the spread of IPv6. Although it has been more than 20 years since the ‘new’ version of the Internet Protocol was introduced by the IETF, it has not been fully deployed yet. Different metrics show that many networks are already using IPv6 but still there is work to do.

Shout out to the 8 female participants of APNIC's first #Hackathon. They spent some time with Sylvia from @ApnicFoundation & @RIPE_NCC 's Vesna. Brilliant minds all of you! #womenintech #APRICOT2018 pic.twitter.com/8u6rThuAAQ
— APNIC (@apnic) February 25, 2018

A total of 25 hackers were chosen from the Asia-Pacific region. It was an honour to be selected to attend this hackathon.

Around this time, I was awarded the WeTech Qualcomm Global Scholarship. This program is a partnership between the Institution of International Education and Qualcomm that invests in young women and future leaders in STEM (Science, Technology, Engineering and Math).

Through the four years of my undergraduate, I was also involved in programs that help guide women into tech. One of those initiatives was by the Rethink initiative. A program I participated in as a coach was the WIT Learning Program.

After completing a bachelors in engineering, I worked at Bang the Table, a startup focussed on building a product that encourages community engagement. Bang the Table would later be acquired by Granicus. My role primarily involved a lot of Rails.

In fall of 2021, I will be a part of the Erasmus-Mundus LCT program as an EMJMD scholar, completing the first year at Charles University, Prague and the second year at Saarland University, Germany. Wish me luck !

Apparently, luck was on my side — the program went well, and the thesis was awarded an excellent grade at Charles University.

I wrote my master’s thesis on unsupervised Open Information Extraction with LLMs (PDF), supervised by Pavel Pecina at Charles University and Günter Neumann at Saarland University. The starting point was DeepEx, which generates candidate triples from the attention matrix of a pretrained language model and then ranks them with a contrastive model trained on millions of Wikipedia-derived sentence-triple pairs. The generation step produces too many candidate triples; the ranker is what accounts for the system’s performance. My question was whether the ranker really needs that much data. Could one trained far more simply, on far less data, do the same job?

The intuition I tested: English is an SVO language, so if you take a subject-predicate-object triple and simply join it, you get a clause. If the triple is a good extraction, that clause should read as a sentence a native speaker would accept. So instead of contrastive ranking, I ranked triples by linguistic acceptability, using BERT fine-tuned on acceptability corpora like CoLA and MegaAcceptability, and probability-based acceptability measures from the sentence acceptability literature to correct for confounds like sentence length and lexical frequency. The substitution held up. With orders of magnitude less ranking data, the system matched or beat DeepEx on most standard OpenIE benchmarks. The ablation studies threw up some interesting results of their own, you can check them out here

Given all that, my next move might seem odd: I became a Site Reliability Engineer. It was not a rejection of NLP. I genuinely love the field. I picked a master’s programme built entirely around it, and the thesis was some of the most fun I have had as an engineer. But the time I spent working with language models also taught me something about my own temperament. Empirical NLP research runs largely on benchmark results: a method works, the delta is real, and the paper offers a hypothesis for why. Testing that hypothesis is usually a separate research programme in itself, so most explanations stay at the level of a plausible mechanism. Ablations tell you which component matters, not why it matters. That is a reasonable way for a field to operate, but it means the “why” behind your own results is often open-ended. Industry does not fix this; it goes further in the same direction. Applied NLP teams work with offline evals and A/B tests. If the metric gets better, the model ships. On a typical product team, understanding why the model works is not what you are paid for, so the model stays a component you measure and monitor rather than one you understand. Infrastructure is not like that. When something breaks, there is a concrete cause: a config change, a saturated resource, a race condition. Sometimes you never find it because the state is gone, but the cause exists, and finding it is the actual job. I prefer that. I like systems where failures have causes