Flink: Semantic Web technology for the extraction and analysis of social networks

Question

Q1. What contributions have the authors mentioned in the paper "Flink: semantic web technology for the extraction and analysis of social networks" ?

Q2. What future works have the authors mentioned in the paper "Flink: semantic web technology for the extraction and analysis of social networks" ?

Q3. What is the current bottleneck in scalability?

Q4. What is the key idea in the structural approach to social science?

Q5. What is the primary reason why the authors cannot benefit from using HayStack?

Q6. What is the main purpose of the web mining component of Flink?

Q7. What is the main reason why the interface is important?

Q8. What types of knowledge sources is used by Flink?

Q9. What is the alternative source of information from emails?

Q10. What is the danger of a close mapping between the ontology and the run-time?

Q11. What is the disadvantage of rule-based expansion of equivalence?

Q12. What is the purpose of the web mining component?

Q13. What is the way to store data on the scale of millions of triples?

Q14. How can a developer improve the performance of a query?

Q15. Why did the authors increase in importance in the last years?

Q16. What is the trade-off between executing a single large query and a single large?

Accepted Answer

The authors present the Flink system for the extraction, aggregation and visualization of online social networks. The authors demonstrate their novel method to social science based on electronic data using the example of the Semantic Web research community.

Accepted Answer

While technology is important, keeping in touch with social science will be just as important in the future. Creating a social ontology that would allow to classify social relationships along several dimensions is among the future work and so is the finding of patterns for identifying these relationships using electronic data. However, networks themselves may also be the subject of much debate in the future, especially if these sources were originally created for a different purpose, and thus their integration could not have been foreseen. For example, a practical question the authors encountered in their work concerns the multiplexity of social relations: a relationship between two individuals may have a different significance to different areas of social life.

Accepted Answer

In terms of technology, the current bottleneck in scalability is the performance of aggregation (identity reasoning) due to the lack of standard query and rule languages and efficient implementations in RDF stores.

Accepted Answer

A key idea in the structural approach to social science is that the way an actor (an individual or a group) is embedded in a network offers opportunities and imposes constraints on the actor.

Accepted Answer

The uniqueness of presenting social networks is also the primary reason that the authors cannot benefit from using Semantic Web portal generators such as HayStack [5], which are primarily targeted for browsing more traditional object collections.

Accepted Answer

The web mining component of Flink employs a co-occurrence analysis technique first applied to social network extraction in the work of Kautz et al. [14].

Accepted Answer

The authors consider the flexibility of the interface important because there many possibilities to present social networks to the user and the best way of presentation may depend on the size of the community as well as other factors.

Accepted Answer

Flink uses four different types of knowledge sources: HTML pages from the web, FOAF profiles from the Semantic Web, public collections of emails and bibliographic data.

Accepted Answer

An alternative source of bibliographic information (used in previous versions of the system) is the Bibster peer-to-peer network [9], from which metadata can be exported directly in the SWRC ontology format.

Accepted Answer

The danger of a close mapping between the ontology and the run-time model is that the application needs to be rewritten whenever the underlying ontology changes.

Accepted Answer

The rule-based expansion of equivalence has the disadvantage that it requires the storage of the same information about all the equivalent instances.

Accepted Answer

The web mining component also performs the additional task of finding topic interests, i.e. associating researchers with certain areas of research.

Accepted Answer

From a scalability perspective, the authors are glad to note that the Sesame server offers very high performance in storing data on the scale of millions of triples, especially using native repositories.

Accepted Answer

In many cases, the developer himself can improve the performance of a query by rewriting it manually, e.g. by reordering the terms or breaking the query in two.

Accepted Answer

Their social connectivity might have even increased in importance in the last years simply by the virtue of the information overload the authors are facing.

Accepted Answer

The trade-off is in terms of memory footprint versus communication overhead: small, targeted queries are inefficient due to the communication and parsing involved, while large queries produce large result sets that need to be further processed on the client side.

Flink: Semantic Web technology for the extraction and analysis of social networks

Figures

Citations

The Semantic Web Revisited

A Framework for Web Science

Social Networks and the Semantic Web

POLYPHONET: An advanced social network extraction system from the Web

Different Aspects of Social Network Analysis

References

Social Network Analysis: Methods and Applications

Social Network Analysis: Methods and Applications.

Social Network Analysis: A Handbook

Networks of scientific papers.

Linked: The New Science of Networks

Related Papers (5)