Generating Efficient Rules for Associative Classification

Please use this identifier to cite or link to this item: http://202.28.34.124/dspace/handle123456789/876

Title:	Generating Efficient Rules for Associative Classification การสร้างกฎที่มีประสิทธิภาพสำหรับการจำแนกเชิงความสัมพันธ์
Authors:	Chartwut Thanajiranthorn ชาติวุฒิ ธนาจิรันธร Panida Songram พนิดา ทรงรัมย์ Mahasarakham University. The Faculty of Informatics
Keywords:	การจำแนกเชิงความสัมพันธ์ กฎความสัมพันธ์ระบุคลาส การแสดงข้อมูลแนวตั้ง การจำแนกข้อมูล Associative Classification Class Association Rule Vertical Data Representation Classification
Issue Date:	31
Publisher:	Mahasarakham University
Abstract:	Associative classification is a classification technique that combines classification and association rule mining for classifying unseen data. In the literature, associative classification technique has been found to be more accurate than traditional classification techniques and gives classifier that is easy to interpret by utilizing association rules. However, if a low minimum support threshold is given, a large number of frequent ruleitems will be generated. Some of the ruleitems are not used for classification and needed to be pruned. Moreover, computation time and memory are massively consumed. These problems are highly intensive especially when an input dataset has a large number of dimensions. In this paper, a new associative classification algorithm is proposed to eliminate unnecessary ruleitems. It directly discovers efficient rules for classification. A vertical data representation technique is implemented to avoid unnecessary ruleitems and speeds up mining processes. The experimental results show that the proposed algorithm archives in terms of accuracy, a number of generated ruleitems, classifier building time, and memory consumption, when comparing to the well-know algorithms, CBA, CMAR, and FACA. การจำแนกเชิงความสัมพันธ์เป็นเทคนิคการจำแนกชุดข้อมูลที่รวมการจำแนกและกฎความสัมพันธ์เข้าด้วยกัน จากการวิจัยที่ผ่านมาพบว่าการจำแนกเชิงความสัมพันธ์สามารถจำแนกข้อมูลได้ถูกต้องมากกว่าเทคนิคจำแนกแบบดั้งเดิมและให้แบบจำลองที่ง่ายต่อการแปลความหมายเพราะอยู่ในรูปบบของกฎความสัมพันธ์ อย่างไรก็ตามการจำแนกเชิงความสัมพันธ์เผชิญกับปัญหาการสร้างกฎรายการจำนวนมากเมื่อค่าสนับสนุนขั้นต่ำถูกกำหนดให้มีค่าน้อย ซึ่งบางกฎไม่ได้ถูกนำมาใช้ในการจำแนก มีความซ้ำซ้อนและต้องถูกกำจัดในภายหลัง ทำให้เวลาในการประมวลผลเพิ่มขึ้นและการใช้หน่วยความจำปริมาณมากสำหรับการสร้างแบบจำลอง ปัญหาเหล่านี้แปรผันตรงตามจำนวนชุดข้อมูลที่เพิ่มมากขึ้น งานวิจัยจึงนำเสนอขั้นตอนวิธีใหม่สำหรับการจำแนกเชิงความสัมพันธ์เพื่อกำจัดกฎรายการที่ไม่จำเป็น โดยมุ่งค้นหาเฉพาะกฎที่มีประสิทธิภาพสำหรับการจำแนก การแทนค่าข้อมูลแนวตั้งได้ถูกนำเข้ามาใ่ช้เพื่อหลีกเลี่ยงกฎรายการที่ไม่จำเป็นและลดเวลาในกระบวนการขุดค้นข้อมูล ผลการทดลองแสดงว่าขั้นตอนวิธีที่นำเสนอมีประสิทธิภาพทางด้านความถูกต้องในการจำแนกข้อมูล เวลาและหน่วยความจำในการประมวลผลเมื่อเปรียบเทียบกับขั้นตอนวิธี CBA CMAR และ FACA
Description:	Doctor of Philosophy (Ph.D.) ปรัชญาดุษฎีบัณฑิต (ปร.ด.)
URI:	http://202.28.34.124/dspace/handle123456789/876
Appears in Collections:	The Faculty of Informatics

Files in This Item:

File	Description	Size	Format
60011260501.pdf		2.89 MB	Adobe PDF	View/Open

Show full item record