Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information

Nathaniel Gustafson; Maria Soledad Pera; Yiu Kai Ng

doi:10.1007/978-3-540-69848-7_20

Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information

Nathaniel Gustafson^*, Maria Soledad Pera, Yiu Kai Ng

^*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

3 Citations (Scopus)

Abstract

Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (i) filter out redundant news articles from RSS feeds, (ii) shed less-informative articles from the non-redundant ones, and (iii) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.

Original language	English
Title of host publication	Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings
Pages	232-247
Number of pages	16
Edition	PART 2
DOIs	https://doi.org/10.1007/978-3-540-69848-7_20
Publication status	Published - 2008
Externally published	Yes
Event	International Conference on Computational Science and Its Applications, ICCSA 2008 - Perugia, Italy Duration: 30 Jun 2008 → 3 Jul 2008

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Number	PART 2
Volume	5073 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	International Conference on Computational Science and Its Applications, ICCSA 2008
Country/Territory	Italy
City	Perugia
Period	30/06/08 → 3/07/08

Access to Document

10.1007/978-3-540-69848-7_20

Cite this

Gustafson, N., Pera, M. S., & Ng, Y. K. (2008). Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information. In Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings (PART 2 ed., pp. 232-247). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5073 LNCS, No. PART 2). https://doi.org/10.1007/978-3-540-69848-7_20

Gustafson, Nathaniel ; Pera, Maria Soledad ; Ng, Yiu Kai. / Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information. Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings. PART 2. ed. 2008. pp. 232-247 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2).

@inproceedings{7e549a6b99ff408aa40c4acd4518f2c3,

title = "Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information",

abstract = "Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (i) filter out redundant news articles from RSS feeds, (ii) shed less-informative articles from the non-redundant ones, and (iii) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.",

author = "Nathaniel Gustafson and Pera, {Maria Soledad} and Ng, {Yiu Kai}",

year = "2008",

doi = "10.1007/978-3-540-69848-7_20",

language = "English",

isbn = "354069840X",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

number = "PART 2",

pages = "232--247",

booktitle = "Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings",

edition = "PART 2",

note = "International Conference on Computational Science and Its Applications, ICCSA 2008 ; Conference date: 30-06-2008 Through 03-07-2008",

}

Gustafson, N, Pera, MS & Ng, YK 2008, Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information. in Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings. PART 2 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 2, vol. 5073 LNCS, pp. 232-247, International Conference on Computational Science and Its Applications, ICCSA 2008, Perugia, Italy, 30/06/08. https://doi.org/10.1007/978-3-540-69848-7_20

Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information. / Gustafson, Nathaniel; Pera, Maria Soledad; Ng, Yiu Kai.
Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings. PART 2. ed. 2008. p. 232-247 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5073 LNCS, No. PART 2).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information

AU - Gustafson, Nathaniel

AU - Pera, Maria Soledad

AU - Ng, Yiu Kai

PY - 2008

Y1 - 2008

N2 - Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (i) filter out redundant news articles from RSS feeds, (ii) shed less-informative articles from the non-redundant ones, and (iii) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.

AB - Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (i) filter out redundant news articles from RSS feeds, (ii) shed less-informative articles from the non-redundant ones, and (iii) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.

UR - http://www.scopus.com/inward/record.url?scp=54249123788&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-69848-7_20

DO - 10.1007/978-3-540-69848-7_20

M3 - Conference contribution

AN - SCOPUS:54249123788

SN - 354069840X

SN - 9783540698401

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 232

EP - 247

BT - Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings

T2 - International Conference on Computational Science and Its Applications, ICCSA 2008

Y2 - 30 June 2008 through 3 July 2008

ER -

Gustafson N, Pera MS, Ng YK. Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information. In Computational Science and Its Applications - ICCSA 2008 - International Conference, Proceedings. PART 2 ed. 2008. p. 232-247. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2). doi: 10.1007/978-3-540-69848-7_20

Generating fuzzy equivalence classes on RSS news articles for retrieving correlated information

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this