Skip to content

Imageomics: Bringing machine learning to life.

The Imageomics Institute GitHub organization hosts the development and distribution of a collection of open-source ML tools used to study the biological information encoded in images and videos integrated with structured biological knowledge. All Imageomics code, data, models, and demos hosted on GitHub or Hugging Face are searchable through the Imageomics Catalog:

Screenshot of Imageomics Catalog, organized by most likes/stars

What is the Imageomics Institute?

The Imageomics Institute is funded by the US National Science Foundation's Harnessing the Data Revolution (HDR) program under Award #2118240 (Imageomics: A New Frontier of Biological Information Powered by Knowledge-Guided Machine Learning). It started in Oct 2021.

You can find a full mission, vision, and abstract under the Imageomics website's About page. In short, the vision of the Institute is to "establish a new scientific field called imageomics that harnesses revolutions in data science and computing, as well as the rapidly expanding collections of biological image data, in order to accelerate biological understanding of phenotypic traits extracted from images of organisms."

History

The inception and research of the Imageomics Institute builds heavily on the "Biology-Guided Neural Networks for Discovering Phenotypic Traits" (BGNN) project, also funded by the US National Science Foundation. BGNN itself built in part on the Phenoscape project (funded by NSF multiple times), which started in 2007 and was incubated at the NSF-funded National Evolutionary Synthesis Center (NESCent).

Code repositories overview

Due to the history (see above) and highly collaborative and cross-disciplinary nature of the Institute, important software products and other code repositories are distributed over several organizations in GitHub, in addition to the ones found here. The following gives an overview and useful links.

Imageomics Institute

Institute collaborators


Disclaimer: Any opinions, findings and conclusions or recommendations expressed in the materials here are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Pinned Loading

  1. bioclip-2 bioclip-2 Public

    Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight] BioCLIP 2 is a biological foundation model trained on TreeOfLife-200M. Despite the narrow training objective, BioCLIP 2 yields ext…

    Python 53 9

  2. Collaborative-distributed-science-guide Collaborative-distributed-science-guide Public template

    Template guide to collaborative work, including GitHub and Hugging Face workflows. Co-developed with the Imageomics Guide.

    Python 1

  3. emb-explorer emb-explorer Public

    An interactive tool for classifying images with a pretrained model and exploring clustering results in 2D space.

    Python 1

  4. pybioclip pybioclip Public

    Python package and CLI that simplifies using the BioCLIP family of foundation models.

    Python 59 11

  5. SST SST Public

    Official repo for paper "Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation". Ideal for specimen image trait segmentation.

    Jupyter Notebook 10 3

  6. TaxonoPy TaxonoPy Public

    A Python package for efficiently aligning organismal taxonomic hierarchies using the Global Names Verifier

    Python 13

Repositories

Showing 10 of 110 repositories
  • catalog Public template

    Catalog app for Imageomics code, data, models, and spaces hosted on GitHub and Hugging Face. Designed for improved discoverability of Imageomics products, the content is updated in real-time through calls to the GitHub and Hugging Face APIs. This repository also serves as a template for others looking to create a similar website.

    Imageomics/catalog’s past year of commit activity
    JavaScript 6 MIT 0 6 1 Updated Mar 12, 2026
  • biobench Public

    Computer vision benchmark for evolutionary biology-related tasks.

    Imageomics/biobench’s past year of commit activity
    Python 10 MIT 2 1 1 Updated Mar 12, 2026
  • bioclip-ecosystem Public

    Centralized resource to describe and link out to all components of the BioCLIP project (Code, Models, Datasets, Spaces).

    Imageomics/bioclip-ecosystem’s past year of commit activity
    HTML 0 MIT 0 5 1 Updated Mar 12, 2026
  • saev-fish-dev Public Forked from OSU-NLP-Group/saev

    SAEs for dichotomous keys with fish

    Imageomics/saev-fish-dev’s past year of commit activity
    Python 0 MIT 7 0 1 Updated Mar 12, 2026
  • dna-trait-analysis Public

    Goal: to find associations between dna data and visual traits.

    Imageomics/dna-trait-analysis’s past year of commit activity
    Jupyter Notebook 4 MIT 1 7 (3 issues need help) 2 Updated Mar 12, 2026
  • 2026-02-09-osu-online Public

    Software Carpentry Workshop on Version Control with Git and R for Scientific Reproducibility

    Imageomics/2026-02-09-osu-online’s past year of commit activity
    HTML 0 0 0 0 Updated Mar 11, 2026
  • Imageomics-guide Public

    Imageomics-focused guide to collaborative work, including GitHub and Hugging Face workflows.

    Imageomics/Imageomics-guide’s past year of commit activity
    Python 4 CC0-1.0 0 7 (1 issue needs help) 1 Updated Mar 11, 2026
  • bioclip-2 Public

    Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight] BioCLIP 2 is a biological foundation model trained on TreeOfLife-200M. Despite the narrow training objective, BioCLIP 2 yields extraordinary accuracy when applied to various biological visual tasks such as habitat classification and trait prediction.

    Imageomics/bioclip-2’s past year of commit activity
    Python 53 9 2 1 Updated Mar 11, 2026
  • wing-segmentation Public

    Butterfly wing segmentation

    Imageomics/wing-segmentation’s past year of commit activity
    Jupyter Notebook 4 MIT 2 3 3 Updated Mar 10, 2026
  • bioclip-vector-db Public

    Vector database training data retrieval of BioCLIP-embedded nearest neighbors.

    Imageomics/bioclip-vector-db’s past year of commit activity
    Python 0 MIT 1 11 4 Updated Mar 9, 2026