Implementasi POS-Tag dan Bilingual Dataset untuk Peningkatan Performa Model Named Entity Recognition Berbasis Bi-LSTM+CRF

Cornelius, Kenny (2022) Implementasi POS-Tag dan Bilingual Dataset untuk Peningkatan Performa Model Named Entity Recognition Berbasis Bi-LSTM+CRF. Bachelor Thesis, Universitas Multimedia Nusantara.

Text
HALAMAN_AWAL.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (1MB)

Text
DAFTAR_PUSTAKA.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (604kB)

Preview

Text
BAB_I.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (945kB) | Preview

Preview

Text
BAB_II.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (1MB) | Preview

Text
BAB_III.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (1MB)

Text
BAB_IV.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (6MB)

Text
BAB_V.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (312kB)

Text
LAMPIRAN.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (4MB)

Abstract

Dewasa ini, pertukaran informasi berlangsung dengan sangat cepat dan mudah de- ngan bantuan internet. Namun, muncul sebuah tantangan dimana informasi yang beredar di internet bersifat tidak terstruktur. Terlebih lagi, informasi-informasi tersebut dapat tertulis dalam berbagai macam bahasa. Oleh karena itu dikem- bangkanlah sebuah sistem yang NER untuk mengolah informasi tersebut. Pada penelitian ini, sistem NER yang dibangun mengaplikasikan pelatihan menggunakan dua dataset dengan bahasa yang berbeda (Bahasa Indonesia dan Bahasa Ingrris) dan POS-Tagging. Penelitian ini bertujuan untuk membandingkan performa model yang dilatih menggunakan dataset dengan satu bahasa dan dua bahasa, serta memband- ingkan performa model dengan atau tanpa penggunaan POS-Tagging. Berdasarkan beberapa skenario percobaan, performa model yang memiliki performa paling baik adalah model yang dilatih menggunakan dataset Bahasa indonesia dan tanpa peng- gunaan POS-Tagging, yang memiliki akurasi sebesar 95%. Penelitian ini juga menyimpulkan bahwa penggunaan bilingual dataset memiliki perbedaan akurasi sebesar 5% (tanpa POS-Tag) dan 12% (dengan POS-Tag). Selain itu penggunaan POS-Tag pada model membuat performa model menurun sebesar 25% pada model yang dilatih menggunakan dataset Bahasa Indonesia dan 8% pada model yang di- latih menggunakan dataset Bahasa Indonesia dan Bahasa Inggris.

Item Type:	Thesis (Bachelor Thesis)
Creators:	Cornelius, Kenny (00000019757)
Contributors:	Christian Young, Julio Suryadibrata, Alethea
Keywords:	Bidirectional LSTM, Bilingual, Conditional Random Fields, Named Entity Recognition, POS-Tagging, prediksi
Subjects:	000 Computer Science, Information and General Works > 000 Computer Science, Knowledge and Systems > 004 Computer Science, Data Processing, Hardware 000 Computer Science, Information and General Works > 000 Computer Science, Knowledge and Systems > 006 Special Computer Methods > 006.2 Special-purpose System, Data Collection, Automatic Identification and Data Capture 000 Computer Science, Information and General Works > 000 Computer Science, Knowledge and Systems > 006 Special Computer Methods > Artificial Intelligence, Machine Learning, Pattern Recognition, Data Mining
Sustainable Development Goals:	Goal 04. Ensure inclusive and equitable quality education and promote lifelong learning Goal 08. Promote sustained, inclusive and sustainable economic growth, full and productive employment and work for all Goal 09. Build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation
Divisions:	Faculty of Engineering & Informatics > Informatics
Date Deposited:	18 Nov 2022 07:52
URI:	https://kc.umn.ac.id/id/eprint/19945

Actions (login required)

View Item

This repository is indexed on

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.