Hashtags as Crowdsourcing: A Case Study of Arabic Hashtags on Twitter

This mixed study aims to highlight the impact of social media in the Arab world, specifically Twitter’s impact on translators’ communities. For this purpose, the role of hashtags among translators will be examined by investigating one particular Arabic hashtag, its purpose, target users, and the classification of content. The hashtag is , #translator_serving_translator. 1) An online survey of six closed questions was employed and posted on Twitter, and 249 responses show that users are from fourteen Arab countries, and the majority is from Saudi Arabia. Hashtag users are translators, freelancers, or TS students. Some are active users who post tweets and answer questions, others only ask questions, and the rest only read tweets. The general attitude toward employing hashtags among translators’ communities was positive. 2) Employing a content analysis approach, the content is classified into two main categories of sharing information and seeking assistance with seven subcategories of each.


Introduction
The last decade of the 20th century involved a revolutionary shift in technology, information, and the nature of media that challenged traditional sources of information and communication and transformed them forever. Today, the Internet, social media, and the digital revolution all help in shaping the media scene for the new century. Screens have replaced the printed page, be it in the form of newspapers, books, magazines, diaries, letters, or even billboards [1]. The recent technological developments, and social media specifically, have altered how How to cite this paper: Hendal with families, friends, and colleagues via computers and mobile phones. Twitter's mission is "to give everyone the power to create and share ideas and information instantly, without barriers" [8]. This microblogging platform has become a significant real-time information resource. Twitter users can post short messages (of up to 140 characters), and any Twitter user can read those messages. Twitter users can access basic data about other users, including the accounts that follow a user and the accounts that a user follows [9]. The most prominent feature of Twitter is its fast dissemination, which makes it an excellent place for posting experiences and sharing information immediately. Out of all social media networks, Twitter has achieved the strongest connection between ordinary people and popular, powerful, and rich people [10], while Zhao et al. [11] claimed that Twitter, as a microblogging service, has become popular and widespread because tweets are compact and can be read quickly, which has caused Twitter to be used to share breaking news, personal updates, and ideas.
According to https://twitter.com [12], the service has 328 million monthly active users, 82% of whom are on mobile. The total number of active users in the Arab world was about 5,797,500 as of March 2017, 40% of whom are in Saudi Arabia (2,400,000 users); among the 17,198,900 tweets produced in the Arab world each day (as of March 2014), 40% are produced by Saudi Arabia, followed by Egypt (17%) and Kuwait (10%) [13].
The hashtag feature, which was created on Twitter, can be included anywhere in a tweet. A hashtag-denoted with the # symbol-is a shorthand convention adopted by Twitter users to manually assign their tweets to a wider corpus of posts on the same topic [14]. According to Small [15], a hashtag is a keyword, or keywords, assigned to information that describes a tweet. This hashtag helps in searching and in organizing information on Twitter, and it provides accurate and timely statistics about trending topics on Twitter. By clicking or tapping on a word or phrase with a hashtag in any post, other tweets with this hashtag will be displayed [12]. For example, someone interested in any topic, such as translation, can find hashtags related to translation, TS, translation CAT (Computer-Assisted Translation), and more. This function applies to all languages and to many social media applications in addition to Twitter, such as Instagram and Facebook. According to Radioum One-an online advertising company-almost 75% of social media users use hashtags [16]. In twitter, virtual user communities defined by hashtags are made to exchange information with other users using the same hashtag. A group of users using the same hashtag in their tweets are known as hashtag community [17].

Translator Service Hashtag
Translators' Crowdsourcing, as in many other professions, starts with using hashtags to label and organize their professional needs among themselves, including

Design
To better understand the content and classifications of topics discussed in this hashtag, the users and their attitudes toward translators' hashtags in general and toward this hashtag in particular will be considered. For this purpose, a mixed approach will be conducted. This approach includes a qualitative content analysi to examine the tweets under this hashtag and a quantitative survey of six questions (in addition to the demographic questions) to observe the members of this crowdsourcing and their attitudes toward the hashtag.

Quantitative Approach
Using Google Forms, a closed-ended online questionnaire was designed in Arabic as the target audience was mainly Arabic. There are six main questions, in addition to the demographic information (sex, age, major, education level, and country).
• Why are you interested in translation (translator, TS student, freelancer, or translation fan)? • Why are you using this hashtag (to ask questions, to participate and answer questions, only to browse/read)?
• Whom does this hashtag serve most (TS student, translators, or TS researchers)?
• What are the most frequently discussed topics (seven topics)?
• Do you think this hashtag is helping you?
• Do you recommend that translators use hashtags in Twitter or in other social media?
The questionnaire was posted in June 2018 on Twitter under the same hashtag, in cooperation with Mr. Alhathloul-the founder of this hashtag-to ensure the same users will be involved.

Qualitative Approach
To explore and classify the content of the hashtag tweets, a content analysis strategy will be employed. According to Vaismoradi [18], the content analysis is systematic coding and categorizing approach aims at investigating large amounts of textual information, in order to determine trends of the used words, their frequency and relationships, and to describe the characteristics of the documents' content. The achievement of this strategy's purpose is "by examining who says what, to whom, and with what effect" ( [18], p. 400).
The corpus of this study is all the tweets during a random month under this  [19]. The first is the simple random sample method. Researchers use the simple random sample method by collecting all tweets from a particular period and then randomly selecting a particular number of tweets.
In the second method, the constructed week sample, researchers select all the tweets from a particular day of the week, such as Monday, and then select one Monday out of all the available Mondays. By comparing the efficiency of the two methods, Kim et al. [19] found that the simple random sample method is more efficient and more representative for sampling Twitter content. Thus, constructed week sampling was avoided here, and all the tweets in the randomly selected month are involved in the content analysis.
Twitter advanced search and the Mediatoolkitwebsite were used to collect all the tweets under the hashtag. The latter, unlike Twitter, enables its users to download the selected tweets with details such as time, date, who tweeted it, the origin country, a content analysis of the tweets, and statistical reports with all the details of all the tweets from the selected period. However, the content analysis of the Mediatoolkitwebsite only categorized the tweets as negative, positive, and neutral. Because our aim is to classify the content of these tweets based on the topics discussed, we will manually examine the tweets to classify the content based on the main trends and information in the tweets. Dann's [20] Twitter content classification is adopted with some modifications based on the chosen tweets to match the focus of this study. Dann claims there are six main categories for Twitter content classifications: a) Conversational: includes queries, referral, action, and responses. b) Status: contains personal opinion or emotional status, temporal (dates and times), location, mechanical (technology), physical experiences, work, automated (e.g. games or software), and activities. c) Pass along: involves retweeting, UGC (links to content produced by the user), and endorsement (links to content not created by the user). d) News: includes headlines, sport news, event, and weather. e) Phatic: contains greetings, fourth wall, broadcast, and unclassifiable (errors and half posted sentences, and finally. f) Spam: tweets generated without users consent. Accordingly, this paper's content will be classified based on these six categories, if applicable. Otherwise, the results will show if there are other categories or if some of Dann's categories are not included in this hashtag.

Qualifications and Academic Status
Most users, 183 (37.5%), have a baccalaureate degree or are undergraduate students, 51 (20.5%) respondents have a master's degree or are master's-level students, and 15 (6%) of them have a PhD degree.

Major
Over a third of the contributors are from a TS major (35.7%) or minor (24.1%). The English linguistics major accounts for 11.6% of the contributors, English literature for 8.8%, and French, English, education, and English and Literature 1 account for the same percentage at 4.8%. Finally, Arabic had the least participants, only two (0.8%), and 4.4% selected others to refer to other disciplines.
Each of the six questions provided multiple choices, and the answers and percentages were coded and acquired from Google Forms. The first question dealt with why participants were interested in translation. As Figure 2 shows, the majority of participants (47%) are interested in translation and TS in general. The next reason is because the participants (41%) work as translators as their main job, followed by individuals who are attracted to translation as a hobby (29.7%). Finally, the freelancers account for 27.7%.
The second question dealt with why participants used the hashtag. Most respondents used this hashtag only to browse and read others' tweets without any interaction (69.9%). Next, 28.5% of participants are active users who participate with questions. Finally, 45% of participants are more active users who participate-tweet under this hashtag-and answer the questions tweeted by others ( Figure 3).
The third question deals with who is most served by the hashtag. The participants believe that this hashtag mostly serves TS students (70.7%).The participants who 40.6% think it serves translators account for 40.6%, and 28.1% think it helps TS researchers ( Figure 4). When asked about the more frequently discussed topics under this hashtag, most of the answers (76.7%) pointed out that inquiries about translating particular terms or phrases are frequent among users ( Figure 5). Professional tweets, including advice and information for translators, account for 65.9%, followed by 1 Some universities have different discipline titles and some disciplines were added to the survey based on participants' notes.        (7) think it was not useful. One person chose not useful at all (0.4%; Figure 6).

Content Classification
As mentioned earlier, all the tweets from one month are included in this analysis. We will classify the categories based on the topics discussed under the hashtag, along with an example of each category in Arabic as the source language and a back translation (BT) in English (see Table 1). Several tweets were retweeted or repeated several times, so they were excluded. As noticed from the sample, some twitter users send questions by direct messages instead of posting their question under the hashtag, while the hashtag users tend to post the received question-anonymously-with the hashtag to get the right and accurate answer from the crowd. A related point to consider is that the responses and comments on each tweet are not included within the results as they do not contain the hashtag. Hence, to read responses of any inquiry, users have to click on the tweet to display all the responses and comments, plus exporting the results in excel sheets do not include any further responses other that the main tweet that contains the hashtag. The source text includes only the main sentence and excludes the hashtags, links, and mentioned accounts. Table 1 presents the overall classifications of the tweets' content. All the tweets in this paper are essentially under the same theme-language and translation. Compared to Dann's content classifications, which based mostly on previous studies, some sub-categories are inapplicable here; others are imbedded in the two main categories proposed by the author.
The main classification of the content is based on the nature of topics discussed in the tweets. The tweets either provide information, or demand information and help, or regardless of the type of information (sources, websites, etc.).
Thus, the first main category is sharing information; this category involves all tweets from users who share the following translation-related information.
1) Events: This category includes information about translation events, such as conferences, workshops, training courses, translation initiatives, or even interview with professional translator via social media applications (e.g. Snapchat). Some of the workshops and training courses were free or online, and the tweets were advertising about the dates or providing feedback or thanks to the organizers of the events. The information about conferences was divided into three types: 1) advertising a conference; 2) sharing experiences of a particular conference; and 3) sharing terminologies and expressions used in a particular conference.
2) Information Sources: It also includes sharing information sources, including dictionaries, photos, pdf files of books and recommended books to read, reports/researches, and URLs. These can be URLs for language learning websites, websites to check if the book is translated or not, educational websites, videos, or reports.
3) Translation: It also includes shared translations of news, wisdoms and proverbs, terminologies, idioms (e.g. the top 50 idioms in a particular field with the translation), and Arabic translations of some phrases from other languages, such as Spanish and French.  The second category of seeking assistance also has seven subcategories. Many users under this hashtag ask questions or ask for help with their translation, and the main subcategories include users asking for the following.
1) Academic Assistance: They ask for academic inquiries. Some users ask about particular programs (majors) to be enrolled in, topics for their dissertations, advice about TS in international and national universities, or about sharing their surveys to be completed by the hashtag users.
2) Practice Assistance: Some users post tweets to ask for short videos to practice translation, and some ask for other people to practice their English with via any application.
3) Consultations: They ask for consultations about the best learning centers for academic certificates such as IELTS and TOEFl.
4) Information Sources: They ask for information sources, such as books. 5) Translation Assistance: They ask for help in translating terminologies, phrases, or paragraphs. 6) Translation Assessments: They ask for feedback for their translations, sometimes for academic purposes where they have a translation assignment to be submitted, but they ask for the crowds' feedback. 7) Experiences: They request translators' experiences and problems in their profession to build up a training course.
Not all Dann's content classification [20] categories are involved in these two main categories, however, some are comprised. For example, Dann's conversational category that includes queries and responses matches the second category here with all the sub-categories under seeking assistance, though all the conversations in the hashtag are about translation or translation-related topics. While Dann's status category, applies to sharing information about events, where the tweets are about work, physical experiences, and activities. Pass along category also matches sharing information about information sources, Dann's "pass along" involves retweeting and user generated content or link to content by other users, the information sources shared under the hashtag are dictionaries, websites and reports. Finally, Dann's News category that includes headlines, sport news and event news, applies to sharing information by translation; where users share translated news, though the first category sharing information about events also involves news about events such as conferences and workshops related to translation.

Discussion and Conclusions
The focus of this paper was to examine one particular Arabic hashtag for translators to crowdsource its users, their attitude toward this hashtag, and the content classifications of the tweets. As the results show, the users are translators, TS students, or freelancers. The hashtag serves fourteen Arab countries, and the majority is from Saudi Arabia. This confirms the 2017 Arab Social Media Report [21] that Saudi Arabia has the highest number of Twitter users among the Arab region, with around 2.6 million users, plus Twitter report that indicated 40% of active users in the Arab world are from Saudi Arabia. Another conceivable reason for this high number of Saudi Arabian users is that the founder of the hashtag is from-and lives in-Saudi Arabia. In addition to the large number of universities in Saudi Arabia, 28, translation is available as a major in bachelor's or master's degrees [22]. However, starting this initiative from one country and extending it to thirteen other countries is promising and confirms the users' attitudes toward this hashtag. Their answers reveal their positive attitudes toward all hashtags in general, and this is confirmed by the increasing number of Arabic hashtags serving translators and translations, such as ‫#اﻟﻤﺘﺮﺟﻢ‬ [translator], ‫#ﻋﺒﺎرات-اﻟﻤﺘﺮﺟﻢ‬ [translator's_phrases], ‫#ﻣﮭﺎرات-ﺗﺮﺟﻤﯿﺔ‬ [translational_skills], ‫#ﻋﺰﯾﺰي-طﺎﻟﺐ-اﻟﺘﺮﺟﻤﺔ‬ [dear_translation_student], and many others. Moreover, the content revealed the creation of Arab hashtags to serve translators for languages other than English, such as French. The suggestion was to add the initial FR after the existing hashtag ‫#اﻟﻤﺘﺮﺟﻢ_ﻓﻲ_ﺧﺪﻣﺔ_اﻟﻤﺘﺮﺟﻢ‬ to target only translators to/from French.
The content classifications of the tweets revealed the importance of this hashtag in sharing and seeking information. Involving users with PhDs' and master's degrees along with specialists in translation, or even professional translators, makes this professional community a trustworthy and reliable source to gain the required information, to discuss and share translation related issues, or to exchange academic and professional experiences.
Finally, the field of social media and crowdsourcing in general, and Twitter and hashtags specifically, is a significant topic that attracted several studies from different regions. Unfortunately, the studies in this field in the Arab region, compared to other regions, are very limited. Considering the short term of this study and the chosen hashtag, several investigations can be obtained from the selected target. For future studies, the information-seeking behavior of Arab translators and a further examination of the information sources they share online will be explored.

Conflicts of Interest
The author declares no conflicts of interest regarding the publication of this paper.