Skip to content

PROJECTS

What we're working on.

Live tools, datasets, and analyses produced by BioKEA — alongside the forthcoming work our partners have asked us to keep under wraps a little longer.

3 LIVE · 2 REVEALING SOON · 2 COMING SOON
Screenshot of the Intertidal Biodiversity DNA Barcode Library statistics dashboard showing 4,384 species, 15 million records, and coverage breakdowns across BOLD, NCBI, and SRA databases. LIVE
Interactive Shiny app
2026

Intertidal Biodiversity DNA Barcode Library

with Coastal Quest

A reference barcode taxonomic coverage gap analysis tool — 4,384 intertidal species along the California coast, cross-referenced against BOLD, NCBI GenBank, NCBI SRA, and GBIF to prioritize which species to sample next.

  • eDNA
  • marine
  • gap analysis
  • DNA barcoding
  • California
Team
  • ★ Sean
Sampling-effort map of California showing geographic spread of insect specimens barcoded under the California Insect Barcoding Initiative. REVEALING SOON
Research paper + dataset
2026

California Insect Barcoding Initiative

The first large-scale DNA-barcode survey of California insects — over 1 million specimens barcoded, estimating a conservative minimum of ~61,000 species statewide with roughly one third still undiscovered. Generates spatial richness interpolations constrained by ecoregion and vegetation type to guide targeted inventory and conservation.

Independent origin Originated independently of BioKEA as Austin's prior research program; now continued under the BioKEA umbrella.
  • DNA barcoding
  • insects
  • biodiversity
  • California
  • conservation
Team
  • ★ Austin

Target reveal · Pending Ecography publication

DaKineDiving interactive map of O'ahu showing dive sites, real-time conditions, and marine biodiversity overlaysDaKineDiving interactive map of O'ahu showing dive sites, real-time conditions, and marine biodiversity overlays
LIVE
Web application
2025

DaKineDiving — real-time dive intelligence for O'ahu

🏆 Built with Claude Sonnet 4.5 Challenge — Winner

A real-time dive intelligence platform for O'ahu, Hawai'i. Combines NOAA tide data, PacIOOS wave buoys, and GBIF biodiversity records to surface conditions, encounter probabilities for 100+ marine species, and Marine Life Conservation District boundaries on an interactive map. Built with Claude Sonnet 4.5.

Independent origin Built by Sean as a solo entry to Anthropic's Built with Claude Sonnet 4.5 Challenge (October 2025); winner of the contest. Surfaced under BioKEA because of the GBIF biodiversity layer; not part of the BioKEA wet-lab pipeline.
  • marine
  • biodiversity
  • GBIF
  • eDNA-adjacent
  • Hawaii
  • AI-assisted build
Team
  • ★ Sean
Map of the San Francisco Bay-Delta estuary showing eDNA sampling stations for the metabarcoding baseline. REVEALING SOON
Dataset + paper
2026

Bay estuary metabarcoding baseline

with San Francisco Estuary Institute

A longitudinal metabarcoding baseline for the San Francisco Bay estuary, in partnership with the San Francisco Estuary Institute.

  • metabarcoding
  • marine
  • estuary
  • Bay Area
Team
  • ★ Michelle
  • Sean

Target reveal · Q4 2026

Visualization of a long-read microbial genome assembly produced on the ONT Promethion 2.
Coming soon Target · Q4 2026
COMING SOON
Dataset
2026–2027

Long-read microbial genome resource

A growing library of high-quality long-read microbial assemblies produced on the ONT Promethion 2 and published as a public resource.

  • sequencing
  • microbial
  • long-read
Team
  • ★ Sean

Target reveal · Q4 2026

Colloquip multi-agent deliberation interface showing specialized scientific personas debating a hypothesis. LIVE
Open-source platform
2026

Colloquip — multi-agent scientific deliberation

An open-source multi-agent AI deliberation platform. Specialized scientific personas — Biology, Chemistry, ADMET, Clinical, Regulatory, Red Team — self-organize to debate hypotheses, with emergent discussion phases and energy-based conclusion instead of fixed turn orders.

Independent origin Originated and maintained independently by Sunit Jain on GitHub; surfaced here through Sunit's advisor role, not authored by BioKEA.
  • AI
  • multi-agent
  • scientific reasoning
  • open source
  • deliberation
Team
  • ★ Sunit
Sequoia title slide — a redwood canopy seen from below, with a stylized golden tree-ring icon at the center, and the words "Introducing Sequoia: BioKEA's Foundation Model for Global Biodiversity. A Living Brain for the Natural World."
Coming soon Target · 2027
COMING SOON
AI foundation model
2026–2027

Sequoia™ — a foundation model for global biodiversity

BioKEA's multimodal foundation model: it learns biology from DNA, images, and the spatial environment all at once — what an organism is, what it looks like, and where it lives. Reads partial DNA, fragmentary photographs, and habitat maps as one signal. Two tiers: the Seed (a small, single-file model that runs on a laptop, for education and open collaboration) and the Forest (the production engine that processes hundreds of millions of biodiversity datapoints).

  • AI
  • foundation model
  • multimodal
  • biodiversity
  • computer vision
  • DNA
Team
  • ★ Sean

Target reveal · 2027

Have a project that belongs here?

We're taking collaboration, contract, and funding conversations.

Start a conversation →