Megan Leszczynski - Stanford University

Hello! I am a final-year Computer Science PhD student at Stanford University advised by Christopher Ré. My PhD has focused on developing methods to leverage structured data (e.g., knowledge graphs) to improve natural language representations. During my time at Stanford, I also had the opportunity to intern at Google Research (mentored by Arun Chaganty) and Microsoft GSL (mentored by Fotis Psallidas). Prior to Stanford, I graduated from Cornell with a double major in Computer Science and Electrical & Computer Engineering, where I was advised by Christopher Batten on computer architecture research.

I am interested in building machine learning systems. Recently, I have worked on improving how machines understand language, through applications in named entity disambiguation, document retrieval, and conversational recommendations. I enjoy trying to write clean, usable code and distilling topics into simple concepts for teaching. I’ve designed and given lectures to Stanford’s CS 224N (NLP with Deep Learning) and CS 229 (Machine Learning) courses and was an instructor for Stanford AI4ALL for high school students.

Publications and Preprints

Generating Synthetic Data for Conversational Music Recommendation Using Random Walks and Language Models

Megan Leszczynski, Ravi Ganti, Shu Zhang, Krisztian Balog, Filip Radlinski, Fernando Pereira, and Arun Tejasvi Chaganty

arXiv Preprint, 2023.

Conversational Music Retrieval with Synthetic Data

Megan Leszczynski, Ravi Ganti, Shu Zhang, Krisztian Balog, Filip Radlinski, Fernando Pereira, and Arun Tejasvi Chaganty

In NeurIPS Interactive Learning for NLP Workshop, 2022.

TABi: Type-Aware Bi-Encoders for Open-Domain Entity Retrieval

Megan Leszczynski, Daniel Y. Fu, Mayee F. Chen, and Christopher Ré

In Findings of ACL, 2022.

Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Maya Varma, Laurel Orr, Sen Wu, Megan Leszczynski, Xiao Ling and Christopher Ré

In Findings of EMNLP, 2021.

Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems (Tutorial).

Laurel Orr, Atindriyo Sanyal, Xiao Ling, Karan Goel, Megan Leszczynski

In VLDB, 2021.

Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

Laurel Orr*, Megan Leszczynski*, Neel Guha, Sen Wu, Simran Arora, Xiao Ling, Christopher Ré

In CIDR, 2021.

Understanding the Downstream Instability of Word Embeddings

Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Christopher Richard Aberger, Christopher Ré

In MLSys, 2020.

Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Tri Dao, Nimit Sohoni, Albert Gu, Matthew Eichhorn, Amit Blonder, Megan Leszczynski, Atri Rudra, Christopher Ré

In ICLR, 2020. Spotlight.

Low-Memory Neural Network Training: A Technical Report

Nimit Sharad Sohoni, Christopher Richard Aberger, Megan Leszczynski, Jian Zhang, Christopher Ré

arXiv Preprint, 2019.

Quantifying the Stability of Word Embeddings

Megan Leszczynski, Sen Wu, Christopher Richard Aberger, Christopher Ré

In WiML at NeurIPS, 2018.

High-Accuracy Low-Precision Training

Christopher De Sa, Megan Leszczynski, Jian Zhang, Alana Marzoev, Christopher Richard Aberger, Kunle Olukotun, Christopher Ré

arXiv Preprint, 2018.

iOS Controlled, Low Cost, Low Power Massage Vest Driven by PIC32

Harry Freeman, Megan Leszczynski, Gargi Ratnaparkhi

Circuit Cellar, 2018.

Machine Solver for Physics Word Problems

Megan Leszczynski and José Moreira

In NeurIPS Intuitive Physics Workshop, 2016

Teaching Experience

Stanford University

CS 329S: Machine Learning Systems Design, Teaching Assistant (Winter 2022)
CS 224N: Natural Language Processing with Deep Learning, Teaching Assistant (Winter 2021)

Cornell University

ECE 4750: Computer Architecture, Undergraduate Teaching Assistant (Fall 2016)
CS 1110: Introduction to Python, Consultant (Fall 2014, Spring 2015, Fall 2015, Spring 2016)