G-binary : a New Non-Parameterized Code for Improved Inverted File Compression
Dervos, Dimitrios/ Evangelidis, Georgios/ Nitsos, Ilias/ Νίτσος, Ηλίας/ Ευαγγελίδης, Γεώργιος/ Δέρβος, Δημήτριος
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Dervos, Dimitrios | el |
dc.contributor.author | Evangelidis, Georgios | el |
dc.contributor.author | Nitsos, Ilias | el |
dc.contributor.other | Νίτσος, Ηλίας | el |
dc.contributor.other | Ευαγγελίδης, Γεώργιος | el |
dc.contributor.other | Δέρβος, Δημήτριος | el |
dc.date.accessioned | 2015-07-04T18:08:56Z | el |
dc.date.accessioned | 2018-02-27T18:10:21Z | - |
dc.date.available | 2015-07-04T18:08:56Z | el |
dc.date.available | 2018-02-27T18:10:21Z | - |
dc.date.issued | 2003 | el |
dc.identifier | 10.1007/978-3-540-45227-0_46 | el |
dc.identifier | http://link.springer.com/chapter/10.1007%2F978-3-540-45227-0_46 | el |
dc.identifier.citation | Nitsos, I., Evangelidis, G. & Dervos, D. (2003). g-binary: a New Non-Parameterized Code for Improved Inverted File Compression. Lecture Notes in Computer Science. 2736. | el |
dc.identifier.citation | Journal: Lecture Notes in Computer Science, vol.2736, 2003 | el |
dc.identifier.uri | http://195.251.240.227/jspui/handle/123456789/4344 | - |
dc.description | Δημοσιεύσεις μελών-ΣΔΟ--Τμήμα Βιβλιοθηκονομίας, 2003 | el |
dc.description.abstract | The inverted file is a popular and efficient method for indexing text databases and is being used widely in information retrieval applications. As a result, the research literature is rich in models (global and local) that describe and compress inverted file indexes. Global models compress the entire inverted file index using the same method and can be distinguished in parameterized and non-parameterized ones. The latter utilize fixed codes and are applicable to dynamic collections of documents. Local models are always parameterized in the sense that the method they use makes assumptions about the distribution of each and every word in the document collection of the text database. In the present study, we examine some of the most significant integer compression codes and propose g-binary, a new non-parameterized coding scheme that combines the Golomb codes and the binary representation of integers. The proposed new coding scheme does not introduce any extra computational overhead when compared to the existing non-parameterized codes. With regard to storage utilization efficiency, experimental runs conducted on a number of TREC text database collections reveal an improvement of about 6% over the existing non-parameterized codes. This is an improvement that can make a difference for very large text database collections. | el |
dc.language.iso | en | el |
dc.publisher | Springer | el |
dc.rights | Το τεκμήριο πιθανώς υπόκειται σε σχετική με τα Πνευματικά Δικαιώματα νομοθεσία | el |
dc.rights | This item is probably protected by Copyright Legislation | el |
dc.title | G-binary : a New Non-Parameterized Code for Improved Inverted File Compression | el |
dc.type | Article | el |
heal.type | other | el |
heal.type.en | Other | en |
heal.dateAvailable | 2018-02-27T18:11:21Z | - |
heal.language | el | el |
heal.access | free | el |
heal.recordProvider | ΤΕΙ Θεσσαλονίκης | el |
heal.fullTextAvailability | false | el |
heal.type.el | Άλλο | el |
Appears in Collections: | Δημοσιεύσεις σε Περιοδικά |
Files in This Item:
There are no files associated with this item.
Please use this identifier to cite or link to this item:
This item is a favorite for 0 people.
http://195.251.240.227/jspui/handle/123456789/4344
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.