top of page

Zeyneb N. Kaya

Hi! I am Zeyneb, a student at Stanford University. My interests are in natural language processing, towards furthering diversity and understanding. This page is a collection of pursuits and fun projects I've taken on. 

Apart from learning about interesting new languages and exploring any form of data I encounter, I like listening to & playing music, collecting cute items & keychains, and videography.

Education.

Stanford University 2024-Present

N/A GPA

Saratoga High School 2020-2024

AI Club Co-President, Linguistics Club Founder & President, Chinese Club Officer 

 

4.0 UW / 4.5 W (10-12) GPA

West Valley Community College

4.0 GPA

Dual Enrollment: Differential Equations, Linear Algebra, Multivariable Calculus, Cultural Anthropology

Research.

My research interests are in natural language processing, linguistics, and data science. As the foundation of inclusivity and understanding, I focus on furthering effective communication. I have worked on impact and community-oriented applications of NLP in the social sciences, working with low-resource languages, bias, and conversational systems. Listed below are selected relevant publications.  

Vector Space Distance as a Measurement of Word Embedding Variability in Low-Resource Linguistic Environments

Zeyneb N. Kaya, Annie K. Lamar

Under Review, The North American Chapter of the Association for Computational Linguistics, 2025

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

Zeyneb N. Kaya, Souvick Ghosh

arXiv

Zeyneb N. Kaya, Annie K. Lamar

Proceedings of the Sixth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT) @ EACL, 2023

Full Scope Word Embedding Variability for Low-Resource Languages

Zeyneb N. Kaya, Annie K. Lamar

IEEE MIT Undergraduate Research and Technology Conference, 2023

MADLIBS: A Novel Multilingual Data Augmentation Algorithm for Low-Resource Neural Machine Translation 

Zeyneb N. Kaya

Regeneron Science Talent Search, 2023-2024

Zeyneb N. Kaya

Proceedings of the Linguistic Society of America (PLSA), 2023

Zeyneb N. Kaya

International Conference on Computational Social Science (IC2S2), 2023

Zeyneb N. Kaya, Manya Sriram

University of California, Santa Barbara, 2022

Honors.

Selected Honors

National Award Winner + Regional Affiliate, 2023

Congressional App Challenge Winner

2021

Stanford Women in Data Science (WiDS) Datathon

Top High School Winner, 2023

Technovation Global Challenge

Semifinalist, 2021

VIP Invitee, 2024

USACO

Silver, 2020

US Presidential Scholars Semifinalist

2024

10th Place, 1st in USA, 2023

Scholastic Art and Writing Competition

Honorable Mention, 2020

TeenInk

Editor's Choice Award, 2022

AP Scholar with Distinction

2023

North American Computational Linguistics Olympiad (NACLO)

Invitational Round Qualifier, 2023

Junior Science and Humanities Symposium (JSHS)  

National Qualifier (Top 5),

2nd Math/CS, 2023

Synopsys Science Fair

1st Award, CSEF Qualifier, 2023

Natural Language Processing Specialization

DeepLearning.AI, 2021

Finalist, 2023

National Merit Scholarship Finalist

2023

Bausch and Lomb Honorary Science Award, University of Rochester

Saratoga High School Junior Awards Ceremony, 2023

Saratoga SMASH'N

5x Nominee, 2022-2023

Writing.

I view language as a medium for influence. Through writing, I want to challenge our views and share new perspectives and stories. My work has been published in various international literary journals and platforms. 

Books.

front.png

Everlasting Connection:

Language, Time, Society, and Technology

Zeyneb N. Kaya

Breaking Barriers:

Celebrating Women and Diversity in Data Science

Zeyneb N. Kaya & Shivani Mudhol

Wids Book (6 x 9 in) (1).jpg

Amazon #1 New Release, STEM Education

Romeyka

Everlasting

Romeyka is a Greek dialect spoken in regions of Turkey near the Black Sea, and is a form of Pontic Greek which shows features in common with Ancient Greek that are distinct from other dialects of the language.

 

Romeyka Everlasting  is devoted to preserving Romeyka and its heritage, using technologies in computational linguistics to document, promote, and research the language. In rediscovering Romeyka, Romeyka Everlasting brings to light the unheard experiences of the community and fosters the continuity of its traditions for posterity. 

Screen Shot 2023-04-02 at 12.11.44 AM.png

Our impact

Gallery.

bottom of page