Escherichia coli is a ubiquitous bacterial species, associated with drug resistant infections; hundreds of thousands of genomes are now available, but are biased towards high-income countries and clinical isolates. Data from sub-Saharan Africa (sSA) are underrepresented in global sequencing efforts and may represent a major source of genetic diversity with respect to transmissible antimicrobial resistance (AMR). We carried out a genomic investigation of extended-spectrum beta-lactamase (ESBL)-producing E. coli colonising adults in Blantyre, Malawi to assess the diversity and AMR determinants and to place these isolates in the context of globally available genomes. We carried out short-read whole-genome sequencing of 473 colonising ESBL E. coli isolated from stool and placed them in the context of a previous curated species wide collection of 10,146 isolates using the popPUNK clustering algorithm and by constructing a core gene phylogeny. The most frequently identified STs in Malawian isolates were the globally successful ST131 and ST410, and blaCTX-M were the dominant ESBL genes, mirroring global trends. However, 37% of Malawian isolates did not cluster with any isolates in the global collection, and the core gene phylogeny was consistent with local subclades including in ST410 and several phylogroup A lineages. Apparent undescribed diversity in Malawian E. coli could be due to local selection pressures or sampling biases in global E. coli collections. Taking a one health approach to further sampling of E. coli from Malawi and sSA, and principled incorporation into unbiased global collections is necessary to understand local, regional and global transmission of both E. coli and priority AMR genes.
This is a companion discussion topic for the original entry at https://doi.org/10.1101/2021.10.07.463523