Masoomeh Moazami; Nadjla Hariri; Atefeh Zarei; Fahimeh Babalhavaeji
Abstract
Aim: The Aim of this study is to analyze the behavior of search and retrieve information by multilingual users in the web environment. Other research objectives are as follows. Methodology: This study is applied in terms of purpose and in terms of method it is a combination study. In the present study, ...
Read More
Aim: The Aim of this study is to analyze the behavior of search and retrieve information by multilingual users in the web environment. Other research objectives are as follows. Methodology: This study is applied in terms of purpose and in terms of method it is a combination study. In the present study, the quantitative part of the research is semi-empirical and in the qualitative part, qualitative content analysis was used to analyze the interviews. In the present study, the researcher first performed three tasks and asked users to search for information in the WorldWideScience search engine and to provide search results. All user behaviors were recorded and evaluated using the software during testing. The study population consisted of Ph.D. students in six fields of sciences (humanities, basic sciences, medical sciences, agricultural sciences, engineering, and arts). In this study, 35 Ph.D. students in six sciences using the web were searched as a sample. The study sample was then divided into two groups: a bilingual user group that is fluent in English and Farsi and the second group of multilingual users fluent in three or more languages. Finally, an analysis was performed on the search and retrieval behavior of users. Then a comparison was made between the information behavior of bilingual users and multilingual users. Results: Qualitative analysis of the nature of information retrieval behavior by multilingual individuals consists of 5 dimensions (defining tasks, formulating retrieval strategies, using translation tools, facing retrieval challenges, and managing (retrieving) retrieval challenges) as well as components.
shahram abdiazar; Nadjla Hariri; fatemeh nooshnfard
Abstract
AbstractBackground and Objectives: The major aim of current research was "to identify the correlation between the state of the philosophic mindedness and the accuracy of their retrieval in selected information databases include Scopus, ProQuest, Science Direct, and Sege and Web of Knowledge Databases. ...
Read More
AbstractBackground and Objectives: The major aim of current research was "to identify the correlation between the state of the philosophic mindedness and the accuracy of their retrieval in selected information databases include Scopus, ProQuest, Science Direct, and Sege and Web of Knowledge Databases. Present research also seeks to answer the following questions:1. What is the average philosophic mindedness of the researchers?2. What is the average of users' retrieval accuracy from the databases under investigation?3. Which of the demographic features of the statistical society has a crucial linkage with the psychological mindedness?4. Which of the demographic features of the statistical society has a crucial linkage with the users’ retrieval accuracy from the databases under investigation?Ultimately, the research hypothesis that "there is a significant correlation between the philosophic subjectivity of the statistical society and their recovery accuracy" will be examined.Methodology: The research method utilized here was semi-experimental. Participants of present survey were 42 researchers in different fields of science (humanities, medical sciences, engineering sciences, arts and basic sciences) in public universities in Tehran, who were chosen on the basis of targeted sampling of the research. In this study, it was endeavored to select factors such as lack of familiarity with search strategies, lack of knowledge of English, and the lack of knowledge of the English language by selecting students to write master's degree dissertation or doctoral theses and autonomous researchers with a master's degree or doctorate in order control the factors so that they will not affect the research outcomes. The Soltani’s philosophic Concept (1996) was utilized to measure the status of the philosophic mindedness of the statistical society. Different studies have been carried out on its validity and reliability. Cronbach's alpha was reported to be 0.91. To elaborate upon the outcomes received from the research data, the mean and standard deviations were deployed for data analysis and Chi-square, Kolmogorov-Smirnov, ANOVA, T-test and Pearson correlation were used to analyze the relationship between variables. Findings: The analysis of the characteristics of the philosophic mindedness of the statistical society is indicative that the category of "good and pleasant philosophic mindedness" is most frequent with 21 people, as well as the category of "philosophic mindedness" with three (7.1%) was the least frequent among the participants. In other words, the users enjoyed the "philosophic mindedness". According to the statistical analysis, Scopus's database has retrieved the most r and semi-relevant resources and the Science Direct database has retrieved the least pertinent, semi-related resources. Thus, the Scopus, Sage, ProQuest, Web of knowledge and Science Directories databases have the highest accuracy or precision in data retrieval from the researchers’ point of view. Based on the results of the statistical analysis, the relationship between the philosophic mindedness and the demographic features of the statistical society, there is a significant relationship between the philosophic mindedness of the users and their gender. The analysis also indicated that there is a strong relationship between the accuracy of the retrieval of the statistical sample of the outcomes retrieved in the scientific research databases and their marital status. Moreover, between the accuracy of the retrieval of the statistical sample of the outcomes in all studied bases and their last degree of education is significant, and with increased education, researchers retrieve and find more relevant sources from the databases. The analysis reveals that there is a positive and significant correlation with the reliability of 99% and the level of error less than 0.01 between the variables of the philosophic mindedness of users and the accuracy of their recovery of the results retrieved from the scientific databases. Besides, the intensity of the relationship of all bases with the philosophic mindedness at the strongest level is almost the same which signifies that by raising the philosophic subjectivity of the statistical sample, they will assess the resources with a higher concentration and decision-making power which ultimately will lead to a better retrieval accuracy in the selection of related resources.Discussion: The results received from current study resembles to the outcomes of the study on the impact of the philosophic mindedness of people on facilitating decision-making. Researches have reported that individuals with a high philosophic mindedness have the right decision-making power and have a high ability to handle and run all affairs. The previous findings also signify that people with a high philosophic mindedness make good decisions to promote their performance. Related judgments in data retrieval from search engines are a kind of decision-making process in the research process that has a direct impact on the philosophic mindedness of users so people with a high philosophic mindedness have better recovery. Based on the results, there is a positive and significant relationship between the accuracy of data retrieval from scientific databases and marital status. These outcomes are in line with the findings of the research. In previous studies, there are also individual, psychological, emotional, and demographic factors of the statistical sample such as gender, age, marital status, education, and occupation of the statistical sample had a positive and significant relationship with the accuracy of data retrieval from the databases.
Elahe Bahmaee; Mohammad Reza Farhadpoor
Abstract
Background and Objectives: The advent of World Wide Web (WWW) in 1990s that was followed by emergence of a large number of web pages made using of automatic information retrieval systems necessary. The first web search engine with a capability of full text search was developed by Brian Pinkerton in the ...
Read More
Background and Objectives: The advent of World Wide Web (WWW) in 1990s that was followed by emergence of a large number of web pages made using of automatic information retrieval systems necessary. The first web search engine with a capability of full text search was developed by Brian Pinkerton in the Washington University. The Web Crawler was able to index the plain texts and allowed the users to search the expressions on the internet. Later, the Lycos, Infoseek in 1994, Excite and Yahoo in 1995 Inktomi in 1996, Google in Sep. 1997 (Gross, 2015), M.S.N and Overture (Sahu, Mahapatra and Balabantaray, 2016) were emerged to overcome the complexity resulted from a surge in the information within the web. According to Wu and Lee (2004), the services delivering by web search play a significant role for those users who seek to elicit information sources to meet their needs which had not been available for them before. Today, the search engines are recognized as an access pass to a huge size of information on the internet, providing the services and tools tomeet a variety of users' information requirements. For this reason, the evaluation of efficiency and performance of search engines is very important because it is necessary for developers and users. (Azimzadeh, Badie and Esnaashari, 2016). As far as Mc Carthy (2006: quoted in Ewans, 2007) claims, a vast majority of the people visiting the web sites access to the webpages or contents of interest through search engines rather than link directly from other pages.The search engine is a software by which the users search the needed various information on the internet as well as retrieval of related outcomes (Mivule, 2017). Craft, Metzler and Strohman (2015) define a search engine as a practical use of techniques retrieving information within the large scale text sets that with different forms reflect those capabilities for whichthey are designed purposely. On the other word, the search engines are the programs that are employing to find the documents matched to the specific keywords on the WWW setting and retrieve a list of documents containing the searched keywords (Khorsheed, Madbouly and Guirguis, 2015). According to Craft, et. al (2015) the capabilities of recovering information are dramatically involving in the structured multimedia documents, meaningful textual contents and other media, relevance, evaluation, information needs, the effective ranking algorithms and interaction with the users, what are still concerned the researchers investigating in retrieval of information. From Ali, Jhandir, Lee, On and Choi (2017) viewpoints, while the data performance for internet acts a s a fuel to back running it, its extensiveness has caused much problems for the users.While the degree of the users' confidence to the search engines and relying on it to display authentic outcomes is questionable, providing suitable, relevant, and high quality information for the users using webpages contents and links between pages is a big challenge for service providers (the search engines). It is while Xu, Wang and Goh (1998) believe that the numerous search engines have been developed to give technically better performance. It indicates that there has been lack of expected qualitative features fromusers 'viewpoints.Given that in the various studies a few and mostly specific measures such as precision and recall have been considered, on the basis of this assumption that a relevant document collection is ever the same without involvement of user, in the present study the relative precision, relative recall, F-number, coverage ratio, freshness ratio, the expected search ratio and failure are investigated as a set of measures evaluating the retrieval efficiency of the search engines in the information and knowledge domain. Therefore, the primary purpose of this study is to determine the retrieval efficiency of the five search engines given the indicators of interest. Meanwhile, the secondary purpose of this article is to identify the retrieval efficiency of the search engines based on such indicators as the relative precision, relative recall, F-number, coverage ratio, freshness ratio, the expected search ratio and failure. The main question is that in this domain how the web search engines operate efficiently to retrieve the information.Methodology: The present study in term of target is applied one and is descriptive in term of survey method. By taking into account the search function and search term as identical condition (the query AND pdf), the authors have evaluated the efficiency of the search engines based on what is observed in the retrieval results. To measure the variables, several formulas related to relative precision, relative recall, F-number, coverage ratio, freshness ratio, the expected search ratio and failure have been used. Alexa-based ranking, this study also investigates such search engines as Google, Bing,Yahoo, Ask and AOL as the most commonly used search engines. In this research, the data was collected through library studies, in order to write a research literature, checklists and through direct observation of the searched outcomes. For this purpose, based on article, Top Trends in Academic Libraries2106, published in the SAGE database, Collage Research Libraries journal, the new domains and future subfields in the information science were determined in 15 terms/ keywords. Then, the 15 terms/keywords were analyzed and to determine the search keywords they were assigned to 20 experts of the field to give a weighted score in order to be placed in the list of this study by a value given to them. Then, 5 keywords with higher weights were selected for searching. The 5 keywords were individually assigned to the 5 search engines each. In the next stage, out of total retrieved outcomes of each search engine for each keyword, 50 documents were placed on the top of the list to evaluate the retrieval efficiency measures in the researcher-made collection. In order to distinguish relevance of documents, according to Zhang, Xu, Wang and Lee (2006)'recommendations, the keyword iteration in the documents, abstracts and their keywords was considered as a measure. To assess the reliability of retrieved results, the retest method was used. For this purpose, over two phases, at 15-day interval (winter 2017) the search and retrieval were conducted again. The correlation results of the two experiments were tested and confirmed at the R=0.89. To analyze the data, the Excel 2013 was employed.Findings: The results showed that the search engines of Google, Ask and Yahoo are of better performance than the other search engines in term of relative precision, relative recall, F-number harmonic mean and failure criterion; the expected search coefficient indicators and freshness ratio; coverage ratio respectively. However, in spite of the results obtained for the study search engines based on the different indicators, in general they are not in ideal situation where in most cases they are lower than the average. Given that in the previously studied search engines, the scientific domain and indicators of efficiency were different than the present one and they have focused mostly on the relative precision and relative recall, it is not possible to compare all the findings. However, the findings from the present study based on indicator of relative precision are consistent with that of studies of Shafi and Rader (2005), Ali and Gole (2016) and on the recall with that of Janson and Molina (2006), Kumar and Prakash (2009), Wang et. Al (2012), Ali and Gole (2016) and on the indicator of coverage ratio with that of Mohammad Ismael and Mansoor Kiakie (2011), Esfandyari Moghaddam (2012) and Janson and Molina (2006).Discussion: It can be concluded that in spite of the search engines have been gained a score and ranked in this study but they are still far from ideals. It follows that the challenges related to evaluation of information retrieval efficiency, despite of using different search engines, implementation of various strategies and different ranking algorithms and methods of document indexing, has to be yet removed and they need more studies. It should be noted that the comparison of the results from this study with some prior findings indicates that neither of search engines can alone meet the required efficiency indicators. Thus, given the different indicators the users have to assign their queries to the search engines. On the one hand, designing the specific search engines with regard to diversity, extent and lexical relationship in the different domains of sciences is very necessary. On the other hand, it is the time to use various patterns such as visual searching, using multilingual thesaurus, retrieving based on weighed indexing in the interface of both specific and general search engines.
Saeedeh Khalilian; Asefeh Asemi
Abstract
Background and Objectives: The purpose of the study was the possibility of using an expert system decision based on the MCDM techniques to help in the retrieval and selection of resource information in IRANDOC. The main problem of the current study was that end-users do not have the same knowledge ...
Read More
Background and Objectives: The purpose of the study was the possibility of using an expert system decision based on the MCDM techniques to help in the retrieval and selection of resource information in IRANDOC. The main problem of the current study was that end-users do not have the same knowledge and experience in familiarizing them with databases and subject-specific skills. Also, users are faced with Persian language problems and errors instructions for information retrieval. On the other hand, the large amount of information retrieved will cause confusion and waste of time for users. Therefore, the best solution is to use a decision making system. This research attempted to feasibility of using the decision-making expert sub-system in the IRANDOC database. Methodology: In this research was used a descriptive-survey method. To collect data, a researcher-made questionnaire was used in part of the research. In the first stage, scientific literature were reviewed and were identified the criteria for assessing and selecting information resources in IRANDOC. Then expert opinions were received about the identified criteria and were finalized them. In the next step, these criteria were placed in the questionnaire to evaluate them. The questionnaire was sent electronically to PhD students in the field of Knowledge and Information Science in Iran. They were asked to score points from 1 to 9 based on a two-by-one preferential judgment. The number 9 had the highest score and the number 1 had the lowest score. Next, an open interview was conducted with the seniors of IRANDOC database to answer the last sub-question of the research. The researcher called on IRANDOC experts to use the experience of technical experts and to use their ideas and ideas. First, the explanations given on the subject under study. Then they were asked about the possibility of joining the expert sub-system to retrieve information resources with a fuzzy approach on IRANDOC. Finally, the data was analyzed by AHP method & Excel software was used for the calculation, drawing charts and graphs. Findings: It is found that Ranking Criterions Based on AHP method as follows: up to date the Prefer 0.196429, Documentary the Prefer 0.173154, Output status the Prefer 0.1164145 & Value and quality of resources the Prefer 0.0270342 was respectively in First to fourth Priority Also Specialist database was predicted possible the link decision based on fuzzy MCDM techniques to help in the retrieval and selection of resource information. There are certain criteria for choosing printed information sources that are responsible for making decisions based on these criteria. There are some differences in the selection of Internet and digital information resources. The easy dissemination of information and the large amount of information resources in information systems has created a variety and access to information for users. IRANDOC also has a web-based information system. Many Persian science information resources are available through the Web. Users are confronted with a lot of problems in IRANDOC when they retrieve information resources related to their information needs. They need to the intelligent tools that help them retrieve and select the information resources. In this research, the criteria were identified and prioritized based on the MCDM fuzzy technique. According to the findings, these criteria were categorized into four categories: the value and quality of information resources; the status of the output of information resources; the timeliness of information resources; and the documentation of information resources. They were measured and prioritized with AHP technique. The findings showed that the availability of information resources was first priority given the results with a preference of 0.196429. Then documentary information resources with 0.173154 preferences of 0.1164145 and the value and quality of the information source with 0.0270342 preference were placed in the second, third and fourth priorities, respectively. Discussion:It was concluded that according to the criteria studied, "timeliness of information resources" has a role in data retrieval. Subsequently, the criteria for documenting, the status of output, and then the value and quality of information resources in selecting and retrieving information resources are prioritized. According to the findings of the research, 10 sub criteria were selected for information retrieval. These ten criteria include credibility of the information resources, the reliability of the information resource, the quality of the information retrieved, the type of retrieved information format, the availability of full text information resources, the language of the information resources, the type of information retrieved (book, article, etc.).The date of publication / release date of information resources, information resources, statistics of citation reports of information resources. In prioritizing these criteria using the AHP technique, the "update of information source" ranked first in the "source of information source" in the second place and the "statistics of citation reports of the source of information" ranked third. It was concluded that the most important factor in selecting of information resource in information retrieval is the up-to-date resource. Perhaps the reason for this conclusion is that today information has become a fundamental human need. The advancement of human civilization depends on the use of information at a convenient time and place. Researchers are trying to access the new and up-to-date information of the world and be able to compete with their rivals. They are always looking for the latest information resources. Therefore, it is imperative that the databases provide users with the facilities and the best resources available to users. IRANDOC can use an expert system for retrieve and select information resources. To achieve this aim, it should go a long way in to join an expert ideal system.
Asefeh Asemi; Parisa Shafiei Alavijeh