Waste Pollution Classification in Indonesian Language using DistilBERT

Bambang Nursandi; Abba Suganda Girsang

doi:10.31943/gw.v15i1.645

Waste Pollution Classification in Indonesian Language using DistilBERT

Authors

Bambang Nursandi Universitas Bina Nusantara, Indonesia
Abba Suganda Girsang Universitas Bina Nusantara, Indonesia

DOI:

https://doi.org/10.31943/gw.v15i1.645

Keywords:

Distilbert, Text Classfication, Natural Language Processing, Data Mining

Abstract

In Indonesia, waste pollution poses pressing environmental and health challenges, making accurate classification vital for targeted mitigation efforts. DistilBERT emerges as a streamlined counterpart to the acclaimed BERT architecture, designed to mirror BERT's advanced linguistic comprehension but with reduced computational demands. By leveraging the essence of transfer learning, DistilBERT benefits from a wealth of information obtained from extensive textual datasets, positioning it as an ideal choice for scenarios marked by limited data accessibility. In our research, we adopted DistilBERT to address the niche challenge of classifying waste types using a constrained dataset derived from Twitter conversations in Indonesian language—a medium notorious for its concise and often ambiguous content. Notwithstanding the dataset's restricted scope and the noise inherent to Twitter, DistilBERT demonstrated an astounding efficacy, registering a precision rate of 98%. This outcome accentuates DistilBERT's capability to navigate and discern complex textual nuances even in data-restricted environments and further highlights the significance of transfer learning in contemporary natural language processing challenges, especially in contexts as critical as Indonesia's waste management efforts

Downloads

Download data is not yet available.

Downloads

Published

2024-04-04

How to Cite

Nursandi, B., & Girsang, A. S. (2024). Waste Pollution Classification in Indonesian Language using DistilBERT. Gema Wiralodra, 15(1), 404–413. https://doi.org/10.31943/gw.v15i1.645

Download Citation

Issue

Vol. 15 No. 1 (2024): Gema Wiralodra

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

The use of non-commercial articles will be governed by the Creative Commons Attribution license as currently approved at http://creativecommons.org/licenses/by/4.0/. This license allows users to (1) Share (copy and redistribute the material in any medium) or format; (2) Adapt (remix, transform, and build upon the material), for any purpose, even commercially.

Waste Pollution Classification in Indonesian Language using DistilBERT

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

additional-menu

tools

supplement

empat

whatsapp

certificate

flagcounter

Information

Current Issue