Advertisement

If you have an ACS member number, please enter it here so we can link this account to your membership. (optional)

ACS values your privacy. By submitting your information, you are gaining access to C&EN and subscribing to our weekly newsletter. We use the information you provide to make your reading experience better, and we will never sell your data to third party members.

ENJOY UNLIMITED ACCES TO C&EN

Big Data

AI tool aggregates microplastics across studies

Mounds of mismatched plastics data get an app for rapid comparison

by Louisa Dalton, special to C&EN
November 25, 2024

 

A gloved hand picks a millimeters-sized flake of plastic with tweezers out of a pile of similarly sized plastic flakes in assorted colors.
Credit: Shutterstock

As microplastics have multiplied, so have the studies quantifying them. And across all those studies, plastics researchers use thousands of different categories for materials, shapes, and size ranges, which means those mounting data can’t be easily combined to show a larger picture.

In terms of data compatibility, “it’s been the Wild West for a long time,” says Win Cowger, research director at Moore Institute for Plastic Pollution Research. “We need a tool to wrangle everything together.”

Cowger joined forces with environmental science graduate student Hannah Hapich and hydrologist Andrew Gray at the University of California, Riverside, to create such a tool (Environ. Sci. Technol. 2024, DOI: 10.1021/acs.est.4c02406). Hapich saw how the growth in natural language processing (NLP) technology, computing power, and open-source artificial intelligence software could automate data harmonization in a way that wasn’t possible even a year ago.

The team’s AI-powered model sorts material and shape categories into hierarchical nested bins. It incorporates more than 50 reference and conversion databases to convert information on microplastics’ length, material, and shape to their volume, mass, and density. Hapich has posted the open-source app and dubbed it MaTCH (Microplastics and Trash Cleaning and Harmonization).

The software, its creators found, can harmonize otherwise incomparable datasets (such as river and drinking water studies) with differing formats, measurements, and particle descriptions. Its NLP technology assigns unfamiliar categories to correct bins 71–94% of the time—which is high for NLP models, Hapich says. “The tool exponentially speeds up the process.”

The tool outputs sunburst plots that organize data hierarchically, which make clear how the tool categorized and sorted the input datasets. “The whole idea is to maximize comparability without flattening the data and destroying specificity,” Hapich says.

California recently became the first government in the world to require testing for microplastics in drinking water, and Cowger wants to make it standard practice to run drinking water analyses through MaTCH. “It really just makes the data extremely useful and rapidly comparable. This is how everybody should do it,” he says.

Advertisement

Article:

This article has been sent to the following recipient:

0 /1 FREE ARTICLES LEFT THIS MONTH Remaining
Chemistry matters. Join us to get the news you need.