top of page
TEAM
Natural Language Query Analysis for Genomic Data
Kashvi Srivastava

Objective: Build a pipeline to query genomic information using natural language queries. For instance, "Find genes associated with Alzheimer’s disease" is a natural language query.
Methodology:
• Data Processing: Clean and process genomic datasets such as UCSC Genome Browser
• Feature Engineering: Generate feature embeddings for natural language queries
• Neural Network Modeling: Fine-tune a pre-trained LLM using the genomic dataset
• Additional Step: Create an interface for the queries








bottom of page