PROJECTS
What we're working on.
Live tools, datasets, and analyses produced by BioKEA — alongside the forthcoming work our
partners have asked us to keep under wraps a little longer.
3 LIVE
· 2 REVEALING SOON
· 2 COMING SOON
LIVE Interactive Shiny app
2026
Intertidal Biodiversity DNA Barcode Library
with Coastal Quest
A reference barcode taxonomic coverage gap analysis tool — 4,384 intertidal species along the California coast, cross-referenced against BOLD, NCBI GenBank, NCBI SRA, and GBIF to prioritize which species to sample next.
- eDNA
- marine
- gap analysis
- DNA barcoding
- California
REVEALING SOON Research paper + dataset
2026
California Insect Barcoding Initiative
The first large-scale DNA-barcode survey of California insects — over 1 million specimens barcoded, estimating a conservative minimum of ~61,000 species statewide with roughly one third still undiscovered. Generates spatial richness interpolations constrained by ecoregion and vegetation type to guide targeted inventory and conservation.
Independent origin
Originated independently of BioKEA as Austin's prior research program; now continued under the BioKEA umbrella.
- DNA barcoding
- insects
- biodiversity
- California
- conservation
Target reveal · Pending Ecography publication
DaKineDiving — real-time dive intelligence for O'ahu
🏆 Built with Claude Sonnet 4.5 Challenge — Winner A real-time dive intelligence platform for O'ahu, Hawai'i. Combines NOAA tide data, PacIOOS wave buoys, and GBIF biodiversity records to surface conditions, encounter probabilities for 100+ marine species, and Marine Life Conservation District boundaries on an interactive map. Built with Claude Sonnet 4.5.
Independent origin
Built by Sean as a solo entry to Anthropic's Built with Claude Sonnet 4.5 Challenge (October 2025); winner of the contest. Surfaced under BioKEA because of the GBIF biodiversity layer; not part of the BioKEA wet-lab pipeline.
- marine
- biodiversity
- GBIF
- eDNA-adjacent
- Hawaii
- AI-assisted build
REVEALING SOON Bay estuary metabarcoding baseline
with San Francisco Estuary Institute
A longitudinal metabarcoding baseline for the San Francisco Bay estuary, in partnership with the San Francisco Estuary Institute.
- metabarcoding
- marine
- estuary
- Bay Area
Coming soon
Target · Q4 2026
COMING SOON Long-read microbial genome resource
A growing library of high-quality long-read microbial assemblies produced on the ONT Promethion 2 and published as a public resource.
- sequencing
- microbial
- long-read
LIVE Open-source platform
2026
Colloquip — multi-agent scientific deliberation
An open-source multi-agent AI deliberation platform. Specialized scientific personas — Biology, Chemistry, ADMET, Clinical, Regulatory, Red Team — self-organize to debate hypotheses, with emergent discussion phases and energy-based conclusion instead of fixed turn orders.
Independent origin
Originated and maintained independently by Sunit Jain on GitHub; surfaced here through Sunit's advisor role, not authored by BioKEA.
- AI
- multi-agent
- scientific reasoning
- open source
- deliberation
Coming soon
Target · 2027
COMING SOON AI foundation model
2026–2027
Sequoia™ — a foundation model for global biodiversity
BioKEA's multimodal foundation model: it learns biology from DNA, images, and the spatial environment all at once — what an organism is, what it looks like, and where it lives. Reads partial DNA, fragmentary photographs, and habitat maps as one signal. Two tiers: the Seed (a small, single-file model that runs on a laptop, for education and open collaboration) and the Forest (the production engine that processes hundreds of millions of biodiversity datapoints).
- AI
- foundation model
- multimodal
- biodiversity
- computer vision
- DNA
Have a project that belongs here?
We're taking collaboration, contract, and funding conversations.
Start a conversation →