Belirteç seçiminin Huffman kodlaması üzerine etkisi

Please use this identifier to cite or link to this item: http://hdl.handle.net/11607/3048

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Günel, Korhan	-
dc.contributor.author	Dincel, Onur	-
dc.date.accessioned	2017-07-19T13:05:59Z	-
dc.date.available	2017-07-19T13:05:59Z	-
dc.date.issued	2016	-
dc.date.submitted	2016	-
dc.identifier.uri	http://hdl.handle.net/11607/3048	-
dc.description.abstract	Bu çalışmada, belirteç seçiminin istatistiksel veri sıkıştırma yöntemlerinden biri olan Huffman sıkıştırma algoritması üzerine etkisi ve verimliliği araştırılmıştır. Bu amaçla Huffman ağacı üretebilmek için düzgün deyimler kullanılarak tanımlanan farklı türdeki belirteçlerin sıkıştırmada sağladığı kazanç hesaplanmış ve sıkıştırma performansları karşılaştırılmıştır. Çalışma beş ana bölümden oluşmaktadır. Giriş bölümünde, veri sıkıştırma tanımından ve veri sıkıştırma yöntemlerinin sınıflandırılmasından bahsedilmiştir. İkinci bölümde veri sıkıştırma yöntemlerinden olan istatistiksel veri sıkıştırma incelenmiş ve bilgi teorisi kavramları açıklanmıştır. Çalışmanın üçüncü bölümünde, kullanılan belirteç türlerini açıklama adına n-gram, Türkçe heceleme algoritması ve düzgün deyim kavramlarından söz edilmiştir. Dördüncü bölümde ise n-gram, hece ve düzgün deyimlerin yanı sıra bunların birlikte kullanımları ile yaratılan belirteçler ile Huffman ağaçları oluşturulmuş ve sıkıştırma işlemleri gerçekleştirilmiştir. Sıkıştırma işlemi yedi farklı doküman üzerinde test edilmiştir ve her bir dokümanın kullanılan tüm belirteç türlerine ait sonuçları elde edilmiştir. Çalışmanın son bölümünde elde edilen sonuçlar tartışılmıştır.	tr_TR
dc.description.abstract	In this study, the effect and efficiency of token selection is investigated on the Huffman compression algorithm, one of the statistical data compression methods. To this end, compression gains for different types of tokens identified using regular expressions to produce Huffman tree is calculated and compression performance is compared. The study consists of five main chapters. In the introductory chapter, it is mentioned that the definition of data compression and classification of the data compression methods. In the second chapter, statistical data compression, one of the data compression methods is examined and basic concepts in information theory are explained. In the third chapter of the study, to describe used token type, it is introduced n-gram, Turkish syllabification algorithm and regular expression concept. Also in the fourth chapter, as well as n-gram, syllable and regular expression, Huffman trees with tokens created with collocation of their is generated and compression processing is performed. Compression processing is tested on seven different documents and the results of each document that is used for all tokens type is obtained. In the last chapter of the study, the results obtained is discussed.	tr_TR
dc.language.iso	tur	tr_TR
dc.publisher	Adnan Menderes Üniversitesi, Fen Bilimleri Enstitüsü	tr_TR
dc.rights	info:eu-repo/semantics/embargoedAccess	tr_TR
dc.subject	Veri sıkıştırma	tr_TR
dc.subject	Huffman kodlaması	tr_TR
dc.subject	n-gram	tr_TR
dc.subject	Düzgün deyimler	tr_TR
dc.title	Belirteç seçiminin Huffman kodlaması üzerine etkisi	tr_TR
dc.title.alternative	Effect of token selection on Huffman coding	tr_TR
dc.type	masterThesis	tr_TR
dc.contributor.department	Adnan Menderes Üniversitesi, Fen Bilimleri Enstitüsü, Matematik Anabilim Dalı	tr_TR
Appears in Collections:	Yüksek Lisans

Files in This Item:

File	Description	Size	Format
Onur DİNCEL.pdf	Yüksek Lisans Tezi	1.2 MB	Adobe PDF	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets