Studying Amount of Attention to Metadata in Scientific-Research Journals Based on the Authors' Instructions and Files Uploaded on the Journals' Websites

Document Type : Original Article


1 Assistant Professor, Department of Knowledge & Information Science, Shahid Chamran University of Ahvaz, Ahvaz, Iran

2 Ph.D. Student, Department of Knowledge & Information Science, Shahid Chamran University of Ahvaz, Ahvaz, Iran


Purpose: Scientific journals, as sources that provide first-class information to the scientific and research community, are expected to present their best to index and store articles accurately and quickly by search engines and databases. Among them, metadata is the best solution. Using metadata in article files can speed up retrieval appropriately in the semantic web environment. Therefore, the amount of information in the hidden web environment will be reduced, and as a result, the access and citation of articles will increase, and this will lead to the acceleration and improvement of the science dissemination process. Therefore, the current research aims to investigate the use of metadata in scientific journal articles inside and outside the country based on the authors' instructions and files uploaded in three formats: Word, PDF, and HTML.
Method: This research is of a descriptive type and was conducted by a survey method. The research community inside the country includes 90 international journals available on the journal ranking site of the Ministry of Science, Research, and Information Technology. For checking the conditions of entering the research, 50 journals were examined if they have an active page and link. Abroad, the multiplicity and dispersion of the subject created a heterogeneous society for investigation. Therefore, to solve this problem, the journals of three major publishers of scientific journals in different subject areas, namely Wiley, Elsevier, and Springer were selected and included in the study, each of which publishes the number of 2756, 2937, and 3750 journals respectively. The aforementioned publishers had the same publication structure and authors' guidelines for all journals, which made it easier to collect data and homogenize the community. The data collection tool is also a researcher-made checklist with 41 items (questions) in three parts to check Word, PDF files, and HTML. Each of the questions was extracted according to the standards
raised concerning metadata on the website of the World Wide Web Consortium (W3C). The face validity of the checklist was confirmed by a group of professors of the Department of Information Science and Epistemology, and its reliability was confirmed with Cronbach's alpha coefficient
of 0.74.
Findings: Among domestic journals, there are no instructions in the authors' guide for preparing a Word file, and among foreign publishers, only Springer has paid attention to this issue. About the HTML file. Also, none of the internal journals have HTML files, but foreign publishers have paid attention to this issue and have acted according to metadata standards. Also, only a limited number of journals have taken into consideration the metadata requirements that can be implemented by the authors and required the authors to comply with them. The result of the significant analysis of the difference between domestic and foreign journals in components of Word, PDF, and HTML showed the significant value of the Kruskal-Wallis test for PDF files and HTML was less than 0.05 (sig≤0.05), so it can be said that the observed difference regarding the level of metadata compliance in PDF files and H.T.M.L., among domestic and foreign journals, is meaningful; But regarding Word files, the observed difference is not significant and the findings are at the same level.
Conclusion: Measuring the difference between domestic and foreign magazines in the components mentioned in three Word, PDF files, and H.T.M.L. is meaningful. Domestic journals (those that are published internationally and were the target of study in this research) do not pay any attention to these components mentioned in the three files of Word, PDF. and did not have HTML. Only 9 journals published by Springer Publications, because this publisher has a general instruction for all authors, have paid attention to the issue of metadata in Word file setting and include it in PDF files and H.T.M.L. of articles. In general, the attention to metadata and its use among domestic journals is almost zero, and it is necessary to pay serious attention to this issue in order to improve the quality of indexing and access to articles.


Main Subjects

Arlitsch, K. (2017). Semantic web identity of academic libraries. Journal of Library Administration, 57(3): 346-358.
Baradar, R. & Najafinia, Sh. (2008). Evaluating the websites of continuous electronic magazines for children and adolescents in Iran. Iranian Journal of Information Processing & Management Quarterly, 23(54): 1-34. [in persian]
Craven, T.C. (2001). Description meta-tags in locally linked Web pages. Aslib Proceedings, 53(6): 203-16.
Dadkhah, M. & Kahani, M. (2016). Information management using semantic web technology. Rahyaft, No. 68. [in persian]
Fenner, M. (2015). Metadata in Microsoft Word documents. URL: 
Gilvari, A. (2004). The design of standard compilation of Persian electronic magazines. PhD. Thesis. Library and Information Department, Faculty of Literature and Human Sciences, Shiraz University. [in persian]
Henshaw, R. & Valauskas, E.J. (2001). Metadata as a catalyst: experiments with metadata and search engines in the Internet journal, First Monday. Libri, 51(2): 86-101.
Henshaw, R. (1999). The First Monday metadata project. Libri, 49(3): 125-31.
Jamali Mehmoi, H. (2003). Semantic web: an evolving method. Informatics, 1(2): 49-66. [in persian]
Kohler, J., Philippi, S., Specht, M. & Ruegg, A. (2006). Ontology based text indexing and querying for the semantic web. Knowl. Based Syst., 19: 744-754.
Larry, E.D. & Lars, E.D. (2012). Digital Forensics for Legal Professionals: Understanding Digital Evidence from The Warrant to The Courtroom. Chapter 27 - Metadata, Syngress: 179-186.
Levering, R. & Cutler, M. (2006). The portrait of a common HTML web page. In: Proceedings of the 2006 ACM symposium on Document engineering (pp. 198-204).
Mahdipour, A. & Hashemzadeh, M. (2011). Information architecture in full-text scientific-research electronic journals in the field of humanities. Library and information Science, 16(58): 35-60. [in persian]
McQuilton, P., Batista, D., Beyan, O., Granell, R., Coles, S., Izzo, M. & et al. (2020). Helping the consumers and producers of standards, repositories and policies to enable FAIR Data. Data Intelligence, 2(1-2): 151-157.‏
Rarius, P. (2018). What is Metadata in a PDF File?
Singh, G. & Jain, V. (2014). Information retrieval (IR) through semantic web (SW): an overview. arXiv preprint arXiv: 1403.7162.‏
Sotoudeh, H., Razmjo, F. & Zare, L. (2009). Evaluation of how to introduce Iranian scientific-research journals on the Internet based on international standards: emphasizing the features that affect the visibility of scientific journals. Library and information Science, 12(48): 22-205. [in persian]
Sullivan, D. (2007). How to use HTML meta tags. Search Engine Watch, No. 5.‏