Filtern
Erscheinungsjahr
Dokumenttyp
- Masterarbeit (18)
- Ausgabe (Heft) zu einer Zeitschrift (15)
- Dissertation (11)
- Studienarbeit (5)
- Bachelorarbeit (3)
- Diplomarbeit (3)
- Habilitation (1)
Schlagworte
- Semantic Web (3)
- ontology (3)
- Linked Open Data (2)
- Maschinelles Lernen (2)
- OWL (2)
- OWL <Informatik> (2)
- Ontology (2)
- RDF <Informatik> (2)
- SPARQL (2)
- mobile phone (2)
- multimedia metadata (2)
- 2019 European Parliament Election (1)
- API (1)
- Algolib (1)
- Analysis of social platform (1)
- Annotation (1)
- Anwendungsintegration (1)
- Articles for Deletion (1)
- Association Rules (1)
- Augenbewegung (1)
- Auslese (1)
- Auswahl (1)
- Belief change, concept contraction, EL (1)
- Bipartiter Graph (1)
- Blickbewegung (1)
- Core Ontology on Multimedia (1)
- Core Ontology on Multimedia (COMM) (1)
- Data manipulation (1)
- Description Logic (1)
- Desktop (1)
- Discussion Forums (1)
- Eclipse <Programmierumgebung> (1)
- Enhanced Representation (1)
- Eye Tracking (1)
- Eyetracking (1)
- Formale Ontologie (1)
- Fotoauswahl (1)
- Function Words (1)
- GReQL2 (1)
- GazeTheWeb (1)
- Generative Model (1)
- Gerichteter Graph (1)
- Handsfree editing (1)
- I-messages (1)
- IT security analysis (1)
- JGraLab (1)
- Kantenbewerteter Graph (1)
- Knowledge Graphs (1)
- Künstliche Intelligenz (1)
- Latent Negative (1)
- Link Prediction (1)
- Linked Data Modeling (1)
- Machine-Learning (1)
- Machinelles lernen (1)
- Metamodel (1)
- MobileFacets System (1)
- Model-Driven Engineering (1)
- Multimedia Metadata Ontology (1)
- Native language identification (1)
- Natural Language Processing (1)
- Netzwerk (1)
- OCL <Programmiersprache> (1)
- OWL-DL (1)
- Online Community (1)
- Ontologie <Wissensverarbeitung> (1)
- Ontologie. Wissensverarbeitung (1)
- Ontology API model (1)
- Ontology alignment (1)
- POIs (1)
- Photographie (1)
- Plug in (1)
- Political Communication (1)
- Reddit (1)
- Regionenlabeling (1)
- Schema Information (1)
- Semantic Data (1)
- Semantic Desktop (1)
- Sesame (1)
- Soziales Netzwerk (1)
- Support System (1)
- Text classification (1)
- Type System (1)
- Type system (1)
- UML (1)
- Unlink Prediction (1)
- Visual Stimuli Discovery (1)
- Vocabulary Mapping (1)
- Vocabulary Reuse (1)
- Web (1)
- Web Science (1)
- Webservice Sail (1)
- Wikipedia (1)
- You-messages (1)
- application programming interfaces (1)
- business process management (1)
- events (1)
- eye tracking (1)
- faceted search (1)
- knowledge work (1)
- metadata formats (1)
- metadata standards (1)
- mobile application (1)
- mobile devices (1)
- mobile facets (1)
- mobile interaction (1)
- mobile phones (1)
- model-driven engineering (1)
- photo selection (1)
- points of interest (1)
- privacy protection (1)
- region labeling (1)
- rich multimedia presentations (1)
- semantic annotation (1)
- semantics (1)
- sensor data (1)
- social media (1)
- social media data (1)
- traffic survey (1)
- visualization (1)
Institut
- Institute for Web Science and Technologies (56) (entfernen)
This Master Thesis is an exploratory research to determine whether it is feasible to construct a subjectivity lexicon using Wikipedia. The key hypothesis is that that all quotes in Wikipedia are subjective and all regular text are objective. The degree of subjectivity of a word, also known as ''Quote Score'' is determined based on the ratio of word frequency in quotations to its frequency outside quotations. The proportion of words in the English Wikipedia which are within quotations is found to be much smaller as compared to those which are not in quotes, resulting in a right-skewed distribution and low mean value of Quote Scores.
The methodology used to generate the subjectivity lexicon from text corpus in English Wikipedia is designed in such a way that it can be scaled and reused to produce similar subjectivity lexica of other languages. This is achieved by abstaining from domain and language-specific methods, apart from using only readily-available English dictionary packages to detect and exclude stopwords and non-English words in the Wikipedia text corpus.
The subjectivity lexicon generated from English Wikipedia is compared against other lexica; namely MPQA and SentiWordNet. It is found that words which are strongly subjective tend to have high Quote Scores in the subjectivity lexicon generated from English Wikipedia. There is a large observable difference between distribution of Quote Scores for words classified as strongly subjective versus distribution of Quote Scores for words classified as weakly subjective and objective. However, weakly subjective and objective words cannot be differentiated clearly based on Quote Score. In addition to that, a questionnaire is commissioned as an exploratory approach to investigate whether subjectivity lexicon generated from Wikipedia could be used to extend the coverage of words of existing lexica.