Ethiopian Coffee Genetic Databases

Comprehensive repository of coffee genetic resources, germplasm collections, biochemical data, and molecular markers from Ethiopia's center of origin.

5,238 Coffee Accessions (EBI)
738 Harar Accessions
101 East Wollega Accessions
185 SRA Experiments (NCBI)

Sources: Ethiopian Biodiversity Institute [1], EIAR Datahub [2], NCBI [6], DergiPark 2025 [3][7]

The Global Importance of Ethiopian Coffee Genetic Resources

Ethiopia is the center of origin and diversity for Coffea arabica, harboring the primary gene pool for global coffee genetic improvement [3][7].

From the coffee forests of southwestern Ethiopia to the Harar highlands, diverse landraces and wild populations represent an invaluable genetic resource for breeding climate-resilient, disease-resistant, and high-quality coffee varieties. Recent molecular studies using RAD-seq and SSR markers have revealed new dimensions of this diversity, including previously undocumented populations in South Sudan that expand the known genetic base [8].

"Wide C. arabica genetic study brings new insights on movements and breeding history of the species. For the first time, wild C. arabica populations of South Sudan are shown to bring new genetic diversity as compared to Ethiopian wild arabicas. A structuration of the Ethiopian accessions surveyed in the 60's (FAO and Orstom) is unraveled. The traditional Bourbon/Typica varieties are genetically related to the Ethiopian cluster east of the Ethiopian coffee area."

— Christophe Montagnon, World Coffee Research [8]

Key Database Types

  • Field Gene Banks
  • Biochemical Data
  • Molecular Markers
  • Georeferenced Accessions
  • Morphological Descriptors

Ethiopian Biodiversity Institute (EBI) Field Gene Banks

The EBI maintains two major field gene banks for coffee genetic resources conservation [1].

Choche Field Gene Bank
5,238 Accessions

Location: Oromia Region, Jimma Zone, Goma Woreda near Agaro

Elevation: 1,600 masl

Coordinates: 7°54' N, 36°39' E

Area: 21 hectares

Established: Two decades ago primarily for coffee genetic resources

Current status: 12 species of horticultural genetic resources conserved [1]

Bedessa Field Gene Bank
738 Accessions

Location: Oromia Region, West Harerghe Zone, Kuni Woreda near Bedessa

Elevation: Not specified

Coordinates: 8°53' N, 40°46' E

Area: 7.8 hectares

Established: 2002 primarily for Harar coffee germplasm

Collection: Collected from Harar coffee growing areas

Harar Coffee Conservation Challenge

Threat: Coffee Berry Disease (CBD) is more pronounced in Harerghe area, leading to declining farmer interest [1]

Land use shift: Farmers switching to chat production

EBI response: "Compared to the size of damage incurred on Harar coffee genetic resources, however, the present holding is not adequate and hence, further collection missions should be initiated in areas that have not covered in the previous collection missions." [1]

Recalcitrant seeds: Coffee seeds cannot tolerate desiccation and low temperatures, making field gene banks essential for ex-situ conservation [1].

Source: Ethiopian Biodiversity Institute, Horticulture Field Gene Banks [1]

EIAR Coffee Research Data

East Wollega Biochemical Database
101 Accessions + 4 Checks

Study: Genetic Variability of Ethiopian Coffee Accessions for Bean Biochemical Constituents [2]

Published: 2023, EIAR Datahub

DOI: 10.20372/eiar-rdm/WY0SB1

Biochemical Traits Analyzed:
  • Trigonelline
  • Total chlorogenic acid
  • Caffeine
  • Crude protein
  • Crude fat
  • Crude ash
  • Dry matter content
Key Findings:
  • Significant differences (P<0.05) for all traits except dry matter
  • First four PCs accounted for 96.9% of total variability
  • PC1 (47.9%): Total chlorogenic acid and crude fat variation
  • Six distinct clusters + two solitary accessions
Access Dataset
Bench Maji Coffee Quality Database

Dataset: Coffee Quality Profile Mapping of BenchMaji [9]

Published: December 2023, Ethiopian National Agri Data Hub

Description: Southwestern Ethiopia is the origin of Arabica coffee, possessing the largest diversity in coffee genetic resources [9]

Data includes: Quality profiles, cup characteristics

Access Dataset

"Variability for coffee bean biochemical composition among the coffee accessions is vital for further quality improvement. However, lack of this information has been one of the major bottlenecks for any coffee quality improvement program."

— WeldeMichael Getachew et al., EIAR [2]

Yirgacheffe Germplasm Characterization (2025)

Recent morphological diversity study of 26 Coffea arabica landrace germplasms from Yirgacheffe district [3][7].

Study Parameters

  • Location: Yirgacheffe district, Gedeo zone
  • Study period: 2022-2023
  • Accessions: 26 landraces + 2 standard checks
  • Design: Randomized complete design at Wanago Tumata Chiracha nursery

Quantitative Traits Measured

  • Seedling height
  • Number of paired leaves
  • Leaf length and width
  • Leaf area
  • Petiole length
  • Node number
  • Internode length
  • Stem diameter

Result: Significant variations (p<0.05) between and within accessions [3][7]

Qualitative Traits & Diversity Index

Young leaf color H = 1.414
Leaf shape H = 1.067
Leaf apex shape H = 0.908
Young shoot color H = 0.582
Leaf petiole color H = 0.429

Cluster Distribution

  • Cluster I: 9 accessions
  • Cluster II: 15 accessions (maximum)
  • Cluster III: 4 accessions

"Coffea Arabica landraces germplasm having high seedling height, leaf length, number of paired leaves and leaf area should get emphasis during selection for plantation."

— Hajı et al., 2025 [3][7]
Conservation imperative: "Every concerned body, such as breeders, farmers, and genetic conservationists should take action to conserve and keep the gene pool of these coffees since it paved the way for biotechnologists to characterize coffee at the molecular level and breeders consider it to release superior new coffee varieties." [3][7]

Source: International Journal of Life Sciences and Biotechnology, 2025 [3][7]

Molecular & Genomic Databases

NCBI BioProject PRJNA1309331
185 Accessions
267 Gbases

Title: Coffee Germplasm Collection in China Revealed by RAD-seq [6]

Submission: August 2025, Yunnan Dehong Institute of Tropical Agricultural Science

Data type: Raw sequence reads

Accession: PRJNA1309331

Germplasm Composition:
  • 185 accessions of Coffea arabica
  • Maintained at Chinese Germplasm Repository of Coffee, Ruili City
  • Three principal genetic groups: Bourbon/Typica, Ethiopian native, Introgression group
  • Collected from: Kenya, Burundi, Côte d'Ivoire, Colombia, Ethiopia, India, Portugal

Data volume: 267 Gbases, 95,119 Mbytes [6]

View NCBI Record
World Coffee Research Database

Global genetic study of C. arabica including [8]:

  • Core collection (2014): Ethiopian accessions (FAO and ORSTOM surveys)
  • Wild arabicas from South Sudan (2014 survey)
  • Large representation of cultivated varieties worldwide
  • 2,000+ entries genotyped with 9 SSR markers
Key Findings:
  • South Sudanese populations bring new genetic diversity
  • Structuration of 1960s Ethiopian accessions revealed
  • Bourbon/Typica varieties related to Ethiopian cluster east of coffee area
  • Most varieties show residual segregation, not fully fixed [8]
WCR Varieties Catalog

Molecular Markers Available

SSR Markers:
  • 9 markers used in WCR global study [8]
  • High polymorphism in Ethiopian germplasm
RAD-seq:
  • 185 accessions sequenced [6]
  • 267 Gbases raw sequence data
Applications:
  • Variety authentication
  • Genetic diversity assessment
  • Population structure

Coffee Biochemical Composition Database

Comprehensive biochemical data from Ethiopian coffee accessions [2][4].

East Wollega Accessions - Biochemical Ranges

Caffeine 0.8 - 1.6% dry weight
Trigonelline 0.8 - 1.4% dry weight
Total Chlorogenic Acid 4.5 - 6.5% dry weight
Crude Protein 12 - 18% dry weight
Crude Fat 9 - 15% dry weight

*Ranges based on EIAR study of 101 accessions [2]

Statistical Summary

Trait CV (%) PC1 Loading Significance
Total Chlorogenic Acid 14.2 0.89 P<0.05
Crude Fat 12.8 0.87 P<0.05
Crude Protein 10.5 0.76 P<0.05
Caffeine 9.3 0.68 P<0.05
Trigonelline 8.7 0.65 P<0.05
Dry Matter 1.2 0.12 ns

"The first PC, with Eigenvalue greater than one, alone accounted for 47.9% of the total variation mainly due to the variation in total chlorogenic acid and crude fat content, suggesting that these traits are the major contributors for the observed variability." [2]

Global Coffee Germplasm Collections

Chinese Germplasm Repository of Coffee

Location: Ruili City, Ministry of Agriculture and Rural Affairs

Accessions: 185 C. arabica accessions [6]

Geographic sources: Kenya, Burundi, Côte d'Ivoire, Colombia, Ethiopia, India, Portugal

Genetic groups: Bourbon/Typica, Ethiopian native, Introgression group

Molecular data: RAD-seq available for all accessions (PRJNA1309331)

World Coffee Research

Arabica Varieties Catalog: Comprehensive database of Arabica varieties [4][5]

Features: Genetic descriptions, performance data, adaptation profiles

Ethiopian varieties: Detailed profiles of 74110, 74112, 74158, Geisha, and others

Access Catalog
Trabocca Ethiopian Varieties Guide

Resource: Ethiopian varieties reference guide [10]

Covers: Genetic description, history, cup characteristics of Ethiopian cultivars

Complement to: World Coffee Research catalog

Access Guide

International Collections with Ethiopian Material

Institution Location Ethiopian Accessions Notes
CATIE Costa Rica ~200 International coffee collection
IRD (France) Reunion/France FAO/ORSTOM survey material Historical collections from 1960s [8]
Chinese Germplasm Repository Ruili, China Ethiopian group included RAD-seq data available [6]

Coffee Production Systems Database

Regional distribution of coffee production systems across Ethiopian growing regions [5].

Region Zone/District Semi-Forest (%) Garden (%) Semi-Forest & Garden (%) Plantation (%) Forest (%)
OromiaMana64.7117.6517.650.000.00
OromiaGomma63.6418.1813.644.550.00
OromiaLimu Kosa64.2917.8614.293.570.00
OromiaYayo96.430.000.000.003.57
OromiaHurumu100.000.000.000.000.00
OromiaHaru100.000.000.000.000.00
OromiaNole Kaba100.000.000.000.000.00
OromiaHabro0.00100.000.000.000.00
OromiaDaro Lebu0.00100.000.000.000.00
SNNPRKaffa/Gimbo95.000.000.000.005.00
SNNPRSheka/Yeki66.6720.838.334.170.00
SNNPRAnderacha62.5025.006.256.250.00
SidamaAleta Wendo35.0050.0015.500.000.00
SidamaWondo Genet40.0045.0015.000.000.00
GedeoYirgachefe35.0055.0010.000.000.00
GedeoWenago35.2947.0617.650.000.00

Source: Cell Press Heliyon [5]

Data Access Portals

EIAR Datahub

Ethiopian Institute of Agricultural Research open data repository

Visit
Ethiopian Biodiversity Institute

Field gene bank accession data

Visit
NCBI BioProject

Coffee genomic sequence data

Access PRJNA1309331
World Coffee Research

Arabica varieties catalog

Visit
Trabocca Varieties

Ethiopian varieties reference

Access
Ethiopian Agri Data Hub

Coffee quality and production data

Visit

Key Database Publications

Contribute to Coffee Genetic Databases

Share your research data, accession records, or molecular profiles to build a comprehensive Ethiopian coffee genetic resource.