Web Search Engines to the Markup Metadata Records of Person Entity (The Fourteen Infallibles) Based on Schema.org

Document Type : Original Article

Authors

1 Associate Professor, Department of Knowledge and Information Science, University of Qom, Qom, Iran

2 Assistant Professor, Department of Knowledge and Information Science, University of Isfahan, Isfahan, Iran

3 M.A., Department of Knowledge and Information Science, University of Qom, Qom, Iran

Abstract

Objectives: The present study aims to survey the reaction of Web search engines to the markup metadata records of a person entity (fourteen infallibles) based on Schema.org at two levels of indexability and semantic visibility.
Methods: The research method is experimental. The research populations consisted of 42 metadata records in the form of two experimental groups (14 records in Microdata format and 14 records in JSON-LD format) and a control group (14 records in HTML format). Another research population is Web search engines (Google and Bing) which was selected by the targeted sampling method. These records were published on an independent website and introduced directly to search engines. The data collection method was structured observation and the data collection tool was researcher-made checklists.
Results: The results showed that Google and Bing search engines indexed the metadata records of person entities in two experimental groups (Microdata and JSON-LD) and also were done semantic visible. The metadata records of the control groups were also indexed in search engines but were not semantic visibility.
Conclusions: Using Scema.org and its syntactic context for markup to create rich snippets will improve their indexability and semantic visibility in Web search engines. Creating structured data in the Web environment will lead to the realization of the Semantic web, and the retrieval of knowledge.
 

Keywords

Main Subjects


Aghadeh, S. (2018). Designing authority data schema based on Microdata method and study of Search Engines reactions to it. Master’s thesis, Knowledge and Information Science. Allameh Tabataba’i University. [in persian]
Aldaej, A. (2015). An Enhanced Semantic VLE Based on schema.org and Social Media. PhD. Dissertation, University of Surre.
Aldaej, A.A. & Krause, P. (2015). An Enhanced approach to semanitic markup of VLEs content based on Schema.org. Paper presented at the 4th International Workshop on Learning and Education with the Web of Data co-located with 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy. Available at:  
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.666.2801&rep=rep1&type=pdf (accessed 1 February 2020).
Babalhavaeji, F., Taheri, S.M. & Agha Abedi, Z. (2015). The Effect of syntax on the indexing & ranking of Metadata Records by the Web Search Engine: a Comparative Study on MARCXML and DCXML Metadata Records. Journal of Knowledge Retrieval and Semantic Systems, 1(3): 43-59. [in persian]
Fardehosseini, M., Taheri. M., Hariri, N., Babalhavaeji, F. & Nooshinfard, F. (2020). Representing Properties and Relationships between Entities of Creative Works in Schema.org Based on Library Reference Model (LRM). Iranian Journal of Information Processing and Management, 36(2): 533-562. [in persian]
Friedrich, C. (2015). What search engines can't Do: Holistic entity search on web data. PhD. Dissertation. Technische Universitat Branunschweig, Braunschweig, Germany.
Hawskey, M., Barker, P. & Compbell, L.M. (2013). New approaches to describing and discovering open educational resources. In: Proceedings of OER 13: Creating a Virtuous Circle. Nottingham, England. Retrieved August 19, 2019 from:
http://publications.cetis.org.uk/wp-content/uploads/2013/04/OER13_resourcediscovery.pdf.
Jalili Manaf, M. (2021). Comparative Study of Indexability and Semantic Visibility of Metadata Records of the Type of Thesis Based on the Method of Rich Snippets in Public Web Search Engines. Master’s thesis, Knowledge and Information Science, University of Qom. [in persian]
Mika, P. (2015). On schema. org and why it matters for the web. IEEE Internet Computing, 19(4): 52-55.
Mixter, J., Obrien, P. & Arlitsch, K. (2014). Describing Theses and Dissertations using Schema.org. In: Proceedings of the International Conference on Dublin Core and Metadata Applications, October 8-11, Austin, TX.
Mohammadi Ostani, M. (2019). Designing microdata schema for Iranian-Islamic information context's manuscripts and studying the reaction of web search engines to the records based on that schema. Phd. Dissertation, Knowledge and Information Science, University of Isfahan. [in persian]
Mohammadi Ostani, M., Cheshmesohrabi, M., Shabani, A., Asemi, A. & Taheri, M. (2019). Methodology Explanation of Schema.org and Analysis of its Approach to the Processing and Organization of Web Content Objects. Iranian Journal of Information Processing and Management, 34(4): 1767-1798. [in persian]
Paulheim, H. (2015). What the Adoption of schema.org Tells About Linked Open Data. In: The 2nd International Workshop on Dataset PROFiling & Federated Search for Linked Data.
Pitchers, J. (2014). The Difference Between Microdata, Structured Data, Rich Snippets And Schema. Retrieved from:         
https://joomstore.com.au/blog/the-difference-between-microdata-structured-data-rich-snippets-and-schema.html
Razaviyeh, A. (2011). Research Methods in Behavioral and Education Sciences. Shiraz: Shiraz University. [in persian]
Ronallo, J. (2012). HTML5 Microdata and Schema. org. Code4Lib Journal, 16.
Safari, M. (2005). Search Engine and Resource Discovery on the Web: Is Dublin Core an Impact Factor. Retrieved from: http://www.webology.ir/2005/v2n2/a13.html
Shafi’ie Alavijeh, S., Ghaebi, A. & Rezaie Sharifabadi, S. (2009). Review of Metadata Elements within the Web Pages Resulting from Searching in General Search Engines. Iranian Journal of Information Processing and Management, 25(1): 71-89. [in persian]
Sharif, A. (2007). Investigating the Effectiveness of Metadata Elements on Ranking Web Pages by Public Search Engines. Library and Information Science, 10 (2): 241-258. [in persian]
Simsek, U., Karle, E. & Fensel, D. (2018). Machine Readable Web APIs with Schema.org Action Annotations. Procedia Computer Science, 137:255-261. Available at:  
https://arxiv.org/abs/1805.05479.(accessed 28 March 2020)
Şimşek, U., Kärle, E., Holzknecht, O. & Fensel, D. (2017). Domain specific semantic validation of schema.org annotations. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 10742 LNCS: 417-429. DOI: https://doi.org/10.1007/978-3-319-74313-4_31
Taheri, M., Nikzad Bahle, R. & Samiee, M. (2018). Study on Search Engines’ Reaction to the Metadata Records Created Based on Combined Method of Rich snippets and Linked Data. Iranian Journal of Information Processing and Management, 33(2): 658-639. [in persian]
Taheri, S.M. & Khosrowjerdi, M. (2018). The effect of syntax on interoperability among metadata standards: Another step towards integrating information systems. Library Philosophy and Practice. Retrieved from:
http://ezproxy.reinhardt.edu:2048/login?url=https://search-proquest com.ezproxy.reinhardt.edu:2043/docview/2164507562?accountid=13483
Tort, A. & Olive, A. (2015). An approach to website Schema.org design. Data & Knowledge Engineering, 99: 3-16.
Wallis, R., Isaac, A., Charles, V. & Manguinhas, H. (2017). Recommendations for the application of Schema.org to aggregated Cultural Heritage metadata to increase relevance and visibility to search engines: the case of Europeana. Code4Lib Journal, 36. Available at:  
http://journal.code4lib.org/articles/12330#appendix1 (accessed 30 March 2020)
Zolghadr, S. (2016). Comparative Study of Indexing and Finding of Microdata Method in Web Search Engines. Master’s thesis, Knowledge and Information Science, Science and Reasarch Branch (Tehran), Islamic Azad University. [in persian]
CAPTCHA Image