With the increase in users of social media websites such as IMDb, a movie website, and the rise of publicly available data, opinion mining is more accessible than ever. In the research field of language understanding, categorization of movie reviews can be challenging because human language is complex, leading to scenarios where connotation words exist. Connotation words have a different meaning than their literal meanings. While representing a word, the context in which the word is used changes the semantics of words. In this research work, categorizing movie reviews with good F-Measure scores has been investigated with Word2Vec and three different aspects of proposed features have been inspected. First, psychological features are extracted from reviews positive emotion, negative emotion, anger, sadness, clout confidence level and dictionary words. Second, readablility features are extracted; the Automated Readability Index ARI, the Coleman Liau Index CLI and Word Count WC are calculated to measure the reviewās understandability score and their impact on review classification performance is measured. Lastly, linguistic features are also extracted from reviews adjectives and adverbs. The Word2Vec model is trained on collecting 50,000 reviews related to movies. A self-trained Word2Vec model is used for the contextualized embedding of words into vectors with 50, 100, 150 and 300 pretrained Word2Vec model converts words into vectors with 150 and 300 dimensions. Traditional and advanced machine-learning ML algorithms are applied and evaluated according to performance measures accuracy, precision, recall and F-Measure. The results indicate Support Vector Machine SVM using self-trained Word2Vec achieved 86% F-Measure and using psychological, linguistic and readability features with concatenation of Word2Vec features SVM achieved may be subject to copyright. Discover the world's research25+ million members160+ million publication billion citationsJoin for free Citation Khan, Rizwan, A.;Faisal, Ahmad, T.; Khan, G. Identiļ¬cation of ReviewHelpfulness Using Novel Textual andLanguage-Context 2022,10, 3260. Editors Nebojsa Bacaninand Catalin StoeanReceived 15 August 2022Accepted 5 September 2022Published 7 September 2022Publisherās Note MDPI stays neutralwith regard to jurisdictional claims inpublished maps and institutional Ā© 2022 by the MDPI, Basel, article is an open access articledistributed under the terms andconditions of the Creative CommonsAttribution CC BY license of Review Helpfulness Using Novel Textual andLanguage-Context FeaturesMuhammad Shehrayar Khan 1, Atif Rizwan 2, Muhammad Shahzad Faisal 1, Tahir Ahmad 1,Muhammad Saleem Khan 1and Ghada Atteia 3,*1Department of Computer Science, COMSATS University Islamabad, Attock Campus,Islamabad 43600, Pakistan2Department of Computer Engineering, Jeju National University, Jeju-si 63243, Korea3Department of Information Technology, College of Computer and Information Sciences, Princess Nourah BintAbdulrahman University, Box 84428, Riyadh 11671, Saudi Arabia*Correspondence geatteiaallah the increase in users of social media websites such as IMDb, a movie website, andthe rise of publicly available data, opinion mining is more accessible than ever. In the research ļ¬eldof language understanding, categorization of movie reviews can be challenging because humanlanguage is complex, leading to scenarios where connotation words exist. Connotation words havea different meaning than their literal meanings. While representing a word, the context in whichthe word is used changes the semantics of words. In this research work, categorizing movie reviewswith good F-Measure scores has been investigated with Word2Vec and three different aspects ofproposed features have been inspected. First, psychological features are extracted from reviewspositive emotion, negative emotion, anger, sadness, clout conļ¬dence level and dictionary readablility features are extracted; the Automated Readability Index ARI, the ColemanLiau Index CLI and Word Count WC are calculated to measure the reviewās understandabilityscore and their impact on review classiļ¬cation performance is measured. Lastly, linguistic featuresare also extracted from reviews adjectives and adverbs. The Word2Vec model is trained on collecting50,000 reviews related to movies. A self-trained Word2Vec model is used for the contextualizedembedding of words into vectors with 50, 100, 150 and 300 pretrained Word2Vecmodel converts words into vectors with 150 and 300 dimensions. Traditional and advanced machine-learning ML algorithms are applied and evaluated according to performance measures accuracy,precision, recall and F-Measure. The results indicate Support Vector Machine SVM using self-trainedWord2Vec achieved 86% F-Measure and using psychological, linguistic and readability features withconcatenation of Word2Vec features SVM achieved neural network; Word2Vec; Natural Language Processing; sentiment classiļ¬cationMSC 68T50; 68T071. IntroductionSentiment analysis is also known as opinion mining. The Natural Language ProcessingNLP technique is used to identify the sentiment polarity of textual data. It is one of thefamous research areas in NLP topics. Peopleās attitudes and thoughts about any movie,events or issue are analyzed with sentiment analysis of reviews. Sentiment analysis ofreviews classiļ¬es the review as having a positive or negative polarity that helps the userdecide about a product or any movie. While large volumes of opinion data can provide anin-depth understanding of overall sentiment, they require a lot of time to process. Not onlyis it time-consuming and challenging to review large quantities of texts, but some textsmight also be long and complex, expressing reasoning for different sentiments, making itchallenging to understand overall sentiment quickly once a new kind of communicationMathematics 2022,10, 3260. Mathematics 2022,10, 3260 2 of 20has been started between a customer and a service provider. People share their opinionabout services through websites. Usually, online products have thousands of reviews, andit is very difļ¬cult for the customers to read every review. Excessive and improper use ofsentiment in reviews makes them unclear concerning a product and it becomes difļ¬cult forcustomers to make the right decision. This entailed a Few-Shot Learner novel approachapplied for NLP tasks, including review sentiments, but focusing less on the impact ofinļ¬uential textual features [1]. In this scenario, sentiment-based review classiļ¬cation isa challenging research problem. Sentiment analysis is a hot topic due to its applicationsquality improvement in products or services, recommendation systems, decision makingand marketing research [2]. The major contributions in the research are as followsā¢The proposed psychological features are positive emotion, negative emotion, anger,sadness, clout conļ¬dence level and dictionary words.ā¢The readability features extracted according to Automated Readability Index ARI,Coleman Liau Index CLI and Word Count WC are calculated to measure thereviewās understandability score.⢠The linguistic features extracted are adjectives and adverbs.ā¢The psychological, readability and linguistic features are concatenated with Word2Vecfeatures to train the machine-learning methods have been used to investigate data and convert raw data intovaluable data. One of the applications of computing is NLP [3,4]. Many advanced algo-rithms and novel approaches have improved sentiment classiļ¬cation performance, butmore productive results can be achieved if helpful textual reviews are used for sentimentclassiļ¬cation. New features are adverbs and adjectives in terms of sentiment classiļ¬ca-tion [5,6], describing the authorās sentiments. The clout feature deļ¬nes the conļ¬dence ofthe review written by the author. The review length feature determines the information thata review has and the readability feature deļ¬nes how much information can be understoodor absorbed by the user. The readability feature also determines the complexity of anyreview for the reviews are short in length, representing opinions about products or a review given by a user has an important role in the promotion of a movie [7].Most people generally search for information about a movie on famous websites such asIMDb, a collection of thousands of movies that stores data about a movieās crew, reviewsby different users, cast and ratings. Hence, surely it is not the only way to bring people tocinemas. In this regard, reviews also have an important analysis makes opinion summary in movie reviews easier by extractingsentiment given in the review by the reviewer [8]. Sentiment analysis of movie reviewsnormally includes preprocessing [9] and the feature-extraction method with appropriateselection [10], classiļ¬cation and evaluation of results. Preprocessing includes convertingall the capitalized words into lower-case words due to case sensitivity, stopping wordremoval and removing special characters that are preprocessed for classiļ¬cation. Differentfeature-extraction methods are used to extract features from the review of a movie orproduct [11]. Most feature-extraction methods are related to lexicon and statistical-basedapproaches. In statistical feature-extraction methods, the multiple words that exist inreviews represent a feature by measuring the different weighing calculations like InverseDocument Frequency IDF, Term Frequency TF and Term FrequencyāInverse DocumentFrequencyTF-IDF [12,13]. In the feature- extraction method lexicon, the extraction oftextual features from the pattern derived among the words is derived from the partsof speech of words tag [14]. The method based on lexicon extracts the semantics fromthe review by focusing on text ordering in sentiment analysis, short text and keywordclassiļ¬cation. The emotions using short text are written on social networking sites whichhave become popular. Emotions used in the review on social networking sites includeanxiety, happiness, fear, analysis of the IMDb movie review website ļ¬nds the general perspectiveof review for emotions exhibited by a reviewer concerning a movie. Most researchers Mathematics 2022,10, 3260 3 of 20are working on differentiating positive and negative reviews. In the proposed work, acontextualized word-embedding technique is used Word2Vec. It is trained on ļ¬fty thousandreviews given by IMDb movie users. The qualitative features extracted using Word2Vecthat involves pretraining and the quantitative features are extracted from LIWC withoutpretraining. Experiments on vector features with different dimensions using the Skip-Gram Method are performed and LIWC extracts the quantitative linguistic features andpsychological features. The psychological features include positive emotion, negativeemotion, anger, sadness and clout, which measure conļ¬dence level from the reviews. Thereadability features include ARI, CLI and WC. Linguistic features include adjectives statistical and lexicon-based methods extract features to increase the modelāsaccuracy. When the features are extracted from the reviews, different feature selectiontechniques are applied to the features that help extract helpful features and eliminate thefeatures that do not contribute to the effectiveness of the classiļ¬cation of sentiment analysisof reviews [15,16]. The classiļ¬cation of sentiments of reviews deļ¬nes the polarity of reviewsand classiļ¬es them as positive or negative. ML and lexical-based methods were used forsentiment analysis. ML methods have achieved high performance in academia as wellas in industry. It is a fact that ML algorithms make the classiļ¬cation performance able toachieve high performance, but data quality is important as well. Data quality can limit theperformance of any ML algorithm regardless of how much data are used to train the modelof the ML classiļ¬ers [17].2. Related WorkThere are two types of user reviews high-quality and low-quality. A high-qualityreview helps to participate in decision making, while a low-quality one reduces helpfulnessconcerning serving users. That is the reason it is necessary to consider the quality of reviewsfor large data identify the quality of reviews, many researchers consider high-quality reviewsand their helpfulness. Ordinal Logistic Regression OLR is applied to application reviewsfrom Amazon and Google Play with the feature of review length [18]. The Tobit regressionanalysis model has been applied to the dataset of TripAdvisor and Amazon book reviewsusing features of review length and word count [19]. The IMDb movie review dataset isselected for this research and serves as the dataset for sentiment classiļ¬cation. Multipletextual features are extracted using the Word2Vec model trained on reviews and LIWC inthis research helps to improve the classiļ¬cation performance of performance of sentiment analysis has been improved gradually with time byfocusing on advanced ML algorithms, novel approaches and DL algorithms. Details aregiven in brief in Table 1, describing the number of papers that achieved the best performanceconcerning review sentiments using advanced DL algorithm CNN-BLSTM was applied to the dataset of IMDb reviews andcompared with experiments on single CNN and BLSTM performance. In the dataset, wordswere converted into vectors and passed to the DL model [20]. Linear discriminant analysison Naive Bayes NB was implemented and achieved less accuracy using only thefeature of sentiwords [21].The Maximum Entropy algorithm was applied to the movie review dataset and fea-tures extracted by the hybrid feature-extraction method and achieved the highest compared to K Nearest Neighbor KNN and Naive Bayes NB. The features usedare just lexicon features positive word count and negative word count [22]. The highestaccuracy achieved for the IMDb dataset of online movie reviews was 89% because fewerdata were used 250 movie reviews concerning text documents for training purposes and100 movie reviews for testing purposes. Mathematics 2022,10, 3260 4 of 20Table 1. Summary of Accuracy achieved on the dataset of IMDb Models/ Approach Features Dataset Accuracy1CNNBLSTMCNN-BLSTMHybrid [20]Word embedding into vectors IMDb reviews Pre train model82% without the Pre train model2 LDA on Naive Bayes [21] Sentiword Net IMDb reviews Maximum Entropy [22] Sentiment words with TF IDF IMDb reviews Naive Bayes [23] Heterogeneous Features Movie review 89%5 Naive Bayes, KNN [2] Word vector sentiword Movie reviews Entailment as FewShot Learner [1] Word embedding into vectors IMDb reviews pretrainmodel7 Deep ConvolutionNeural Network [24] Vector Features IMDb Movie Reviews LSTM [25] Vector Features IMDb Movie Reviews Neural Network [26] Lexicon Features IMDb reviews 86%Heterogeneous features were extracted from the movie review to achieve the bestperformance for Naive Bayes [23]. There are also some other Amazon datasets publiclyavailable with many non-textual features. Furthermore, many researchers have also workedon an Amazon dataset, analysing reviews using non-textual features, which include productfeatures, user features and ratings [27,28]. The above literature concludes that to improvethe performance of the model features, the size of the dataset plays a more important role;only the use of an efļ¬cient algorithm is not sufļ¬cient to improve the performance of this experimentation dataset of 5331 positive and 5331 negative processed snippetsor sentences, the sentences are labelled according to their polarity. The total number ofsentences used for training purposes is 9595 sentences or snippets and 1067 sentences areused to test the model. First, the pretrained Word2Vec is used for feature extraction andthen Convolutional Neural Network CNN is applied to these features extracted fromWord2Vec. The Google News dataset contains 3 million words on which Word2Vec istrained to achieve the embedding of words into vectors. Testing accuracy is achieved onthe test dataset and is [24].In this paper, three datasets are used; the ļ¬rst dataset consists of 50 thousand reviews25 thousand are positive, and 25 thousand are negative. The data are already separated inthe form of training and testing reviews in which the ratio of positive and negative numbersof reviews is the same. The ļ¬rst drawback of this experimentation is that the dataset is notselected for training and testing of randomized models, which bringsbias to this paper. Thesecond dataset used in the experiments is 200 movies, each having ten categories in DoubanMovies. The rating of movies was from 0 to 5. A movie rating of 1 to 2 was considered anegative review and a 3 to 5 movie rating is considered a positive review of the movie. Thecomments that had a rating of 3 were ignored. So, there were 6000 used as training andthe other 6000 were used to test the dataset. The total number of comments achieved afterremoving neutral reviews was 12,000. The second drawback is that in this paper, the ratiois 5050 and most of the references show that 8020 or 7030 is the best ratio for splittingthe dataset. For evaluating the classiļ¬cation performance, three classiļ¬ers are used forsentiment classiļ¬cation. One is NB, an extreme learning machine and LSTM is conductedbefore that dataset is passed through Word2Vec for word embedding. The word vectorswere sent to LSTM for classiļ¬cation and the results show that LSTM performed better thanother classiļ¬ers. The LSTM F-Measure was [25]. The last reference mentioned in Mathematics 2022,10, 3260 5 of 20Table 1, shows that the accuracy achieved by NN is 86% using lexicon features. This alsoapplies to neural networks. In the IMDb dataset of movie reviews used in this research,reviews are normalized using the following steps All the words of reviews are convertedto lower case from upper case words or characters. Secondly, numbers are removed, specialcharacters, punctuational marks and other diacritics are removed. White spaces includedin the review were also removed. Finally, abbreviations are expanded and stop words inreviews are also removed. All the processing of reviews involved in the referred paper isdescribed above [26].Word Embedding Using the Word2Vec ApproachWhile representing a word, the context in which the word is used matters a lot becauseit changes the semantics of words. For example, consider the word ābankā. One meaning ofthe word bank is a ļ¬nancial place, and another is land alongside water. If the word ābankāis used in a sentence with words such as treasury, government, interest rates, money, etc.,we can understand by its context words its actual meaning. In contrast, if the context wordsare water, river, etc., the actual meaning in this case of context word is land. One of theemerging and best techniques we know for word embedding is used in many ļ¬elds suchNLP, biosciences, image processing, etc., to denote text using different models. The resultsusing word embedding are shown in Table 2. Word2Vec results in other ļ¬elds ResultsImage Processing [29] 90% accuracyNatural Language Processing Tasks [30] More than 90% accuracyRecommendation Tasks [31] Up to 95% accuracyBiosciences [32] More than 90% accuracySemantics Task [33] More than 90% accuracyMalware Detection Tasks [34] Up to 99% accuracyWord embedding is most important and efļ¬cient nowadays in terms of representing atext in vectors without losing its semantics. Word2Vec can capture the context of a word,semantic and syntactic similarity, relation with other words, etc. Word2Vec was presentedby Tomas Mikolov in 2013 at Google [35]. Word2Vec shows words in a vector space. Thewords in the review are represented in the form of vectors and placement is carried out sothat dissimilar words are located far away and similar meaning words appear together invector Proposed MethodologyThe proposed methodology, the environment of hardware and software was set asneeded to perform experiments. The hp laptop core i5 4th generation having 8 GB RAMis used for experimentation. The Google Colab software is used and is the IntegratedDevelopment Environment for the Python language in which we peformed our the latest libraries of Python are used for experiments. The research methodologyconsisted of four steps. The steps are dataset acquisition, feature engineering, models andevaluation, shown in Figure 1below. Figure 1deļ¬nes that after preprocessing of dataacquisition from the IMDb movie review website, it is passed for feature engineering, whichconsists of three blocks B, C and D. B, C and D blocks are used independently as well asin hybrid; B and C, and B and D blocks are named Hybrid-1 and Hybrid-2, E consists of 10-fold cross-validation, training and testing of different ML modelsand the last one is the evaluation process of models. After extraction of features, eachfeature is normalized using the Min/Max Normalization technique. On the normalizedfeature, 10-fold cross-validation is applied to remove the bias. Machine-learning ML anddeep-learning DL models are trained and tested; these are Support Vector Machine SVM,Naive Bayes NB, Random Forest RF, Logistic Regression LR, Multi-Layer Perceptron Mathematics 2022,10, 3260 6 of 20MLP, Convolution Neural Network CNN and Bidirectional Gated Recurrent Unit Bi-GRU. The results were achieved after implementing models on features and were Review Dataset AcquisitionLinguistic Inquiry and Word CountWord2Vec model training and word Embedding Pretrained glove modelMin/Max Normalization Hybrid A+B Hybrid A+B10 k fold stratified cross validationSVMNBRandom ForestLogistic RegressionCNNBiGRUAccuracyRecallPrecisionF1 ScoreComparison AB C DEFigure 1. General Diagram of working ļ¬ow of Research Dataset AcquisitionThe benchmark of the movie review dataset from IMDb is collected and availablepublicly. The main dataset exists of 50,000 reviews with polarity levels. The ground ratingis also available according to the 10-star rating from different customers. A review with arating of less than 4 is a negative review, and a review with a score of more than seven is apositive review. All the reviews are equally pre-divided into 25,000 positive reviews andthe other 25,000 negative. Each review is available in the text document. Fifty-thousandtext documents containing reviews were Preprocessing for Feature ExtractionAfter downloading, each text document including reviews is preprocessed by usingPYCHARM IDE. In two columns, all the reviews and their polarity are read and written inthe Comma Separated Value CSV ļ¬le. One column indicates the reviews and the secondcolumn indicates the polarity. Firstly, the reviews in sentences tokenized into words andthen all the special characters, stop words and extra spaces are removed from the reviewusing the NLP tool kit library available in Python. The preprocess reviews are written upin the preprocess column of the CSV ļ¬le for future Data Preprocessing ToolFor data preprocessing, we use the tool PYCHARM 2018 IDE and Python version Natural Language Tool Kit NLTK is used for text processing such as tokenizationand stop word removal. Google Colab is used for implementing DL algorithms because itprovides GPU and TPU for fast processing. Mathematics 2022,10, 3260 7 of Feature Feature Extraction Using LIWCThe LIWC consists of multiple dictionaries to analyze and extract the features. Toextract psychological, textual and linguistic features from the movie review dataset, LIWCis used. First, the reviews are preprocessed and then used to extract features, as describedin Figure 2. The diagram ļ¬ow is deļ¬ned as the preprocessed reviews passing sent to LIWCfor extraction of the feature. LIWC compares each word of review from its dictionariesto check which category the given review word belongs to. It calculates the percentageby counting the number of words in the review that belong to a speciļ¬c category anddivides by the total number of reviews. The division result is multiplied by 100 to obtain apercentage as described in Equation 1.x=Count the number o f words in review that bel ong to s peci f ic categoryTotal numb er o f words i n review s Ć100 1xdenotes the speciļ¬c subcategory of features in LIWC. The features calculated byLIWC are positive emotionPE, negative emotionNE, angerAng, sadness, clout,dictionary wordsDic, adverbsAdvand adjectivesAdj.PE,NE,Ang,SadandCloutare categorized byLIWCas psychological categorized aslinguistic ReviewsLowercase ,Remove stop words, Extra spaces, special characters, LemmatizationLIWCExtract FeaturesLinguistic/Summary LanguagePsychologicalReadabilityEPositive Emotion, Negative Emotion, Anger, Sad, clout, Adjective, Adverb, Dictionary Words ARI, CLI, Word CountBFigure 2. Feature Engineering with 3shows that after the extraction of features, Min/Max Normalization is appliedand then passes through block E for further implementation, including 10-fold cross-validation, training of ML models, testing of ML models and last is evaluation. Mathematics 2022,10, 3260 8 of 20InmovieheroplaygoodWt Wt-2 Wt-1 Wt+1 Wt+2 Wt+3 Sci-fiction000100Input Layer Hidden Layer Output Layer Example In Sci-fiction movie hero play good role ever and the Window size 7Figure 3. General Diagram of working ļ¬ow of Research Readability Feature ExtractionThe readability score of reviews deļ¬nes the effort required to understand the text ofreviews. The three readability features are calculated on the preprocessed reviews ARI,CLI and word is used for measuring the readability of English text and it is calculated by usingthe formula given in Equation 2.ARI = ĆCW+ ĆWSā 2where Crepresents characters that counts letters and numbers in review, Wrepresentswords and the number of spaces in review. Srepresents sentences that is the number ofsentences in Iscores deļ¬ne how difļ¬cult text is to understand and it is calculated by using theformula given in Equation 3.CL I = 3whereLrepresents the average number of letters per 100 words and S represents theaverage number of sentences per 100 words to measure the understandability of a count WC is calculated by linguistic inquiry word count which consists ofmultiple dictionaries and is calculated with Equation 4.WordCount =Nallwords āN punctuationāNstopwords āNnonalpha 4where Nallwords represents the total number of words in the review text, Npunctuationrepresents the number of punctuation characters in the review text, Nstopwords representsthe number of stop words in the review text and Nnonalpha represents the number ofnon-alphabetic terms in the review the extraction, each readability feature of Min/Max Normalization is applied, asdescribed in the next Word Embedding by Review-Based Training of Word2Vec ModelThe features of movie reviews are extracted by training the Word2Vec neural sequence of the feature-extraction process is given in Figure 4below. Firstly, for Mathematics 2022,10, 3260 9 of 20training, the neural model of Word2Vec data is prepared using the dataset of IMDb moviereviews with 50 thousand reviews. The total number of words included in this dataset is6 review was used in the training of the Word2Vec neural model and three differentembedding sizes were used in experiments, 50, 100 and 150, with a context size of 10. Thereare two methods for training the Word2Vec neural model; one is the COBOW context ofthe bag of words and the second one is the Skip-Gram Method. We used the Skip-GramMethod, which focuses on less frequent words and gives good results concerning wordembeddings of less frequent words. Skip-Gram Method operations are given in Figure 3deļ¬nes that the model is trained by deļ¬ning the window size 10 and Skip-Gram computes word embedding. Instead of using context words as input to predict thecenter word like a context bag of words, it used the center word as input and predictsthe center wordās context words. For example, āIn Sci-ļ¬ction movie hero play good roleāwith context size 7. Training instances are created such as āInā is the target word which isthe input and the context word āSci-ļ¬ction movie hero play the good roleā is the outputword. The training instances are given in Table 3. Using training samples deļ¬ned above inthe table used for training the neural network, the result of word embedding is generatedfor each word given in the vocabulary. The trained model is saved and movie reviewspass to these models for converting words into vectors. Three different types of vectorshaving sizes of 50, 100 and 150 are created. For classiļ¬cation, Word2Vector features areused measured by Skip-Gram Method passed to block ReviewsLowercase ,Remove stop words, Extra spaces, special characters, Lemmatization50 Thousand Reviews6,142,469 wordsVocabularyTrain Word2vec Neural ModelTrained Word2vec ModelTrained Word2vec ModelTrained Word2vec Model50 embedding sizeContext size 10 100 embedding sizeContext size 10150 embedding sizeContext size 10Skip gram Method Skip gram Method Skip gram MethodTest Model Test Model Test ModelVector50 Vector100 Vector150EConvert Sentences into wordsC1CFigure 4. Feature Extraction Process with self Pretrained Word2Vec 3. Word2Vec Results in other ļ¬eldsā OutputIn sci-ļ¬ctionIn movieIn heroIn playIn goodIn roleIn ever Mathematics 2022,10, 3260 10 of Word Embedding by Pretrained Word2Vec ModelThe Glove Model is an unsupervised learning algorithm used for vector representationof words. Training samples are taken from Wikipedia and different books. The GloveModel uses a generalized kind of text on which it is trained. Figure 5describes the stepsfor word embedding into ļ¬rst step is to download the Glove Model in the zip ļ¬le with 150 vectors and300 vectors. The pretrained Glove Model is loaded and passed for test on the preprocessedreviews. Each preprocess review consists of words and is passed to the test model as inputand output are received as the vector of each review by taking the average of vectors. Eachreview vectors has 150 and 300 numbers in review vectors. The output of the vector ispassed to the E block for further implementation, which includes 10-fold cross-validation,training ML models, testing ML models and Million Vocabulary Wikipedia +books Preprocessed ReviewsConvert Sentences into wordsTrained glove Model 300Trained glove Model 150Test trained ModelTest trained ModelVectors 150 Vectors 300ETaking Mean of vectors Taking Mean of vectors DFigure 5. Feature Extraction Process with Pretrained Word2Vec Evaluation and DatasetThe dataset selected for the experiment is IMDb movie reviews, consisting of 50,000 re-views of different movies with sentiment polarity. The reason for this dataset selection isthat it is the largest number of reviews compared to the previously uploaded dataset ofmovie reviews on the website accessedon 4 April 2022. A total of 25,000 reviews are positive and the other 25,000 thousandreviews are negative. Each review is in the text ļ¬le so in the zip ļ¬le 50,000 text ļ¬les areincluded with their rating value from 1 to 10 as text ļ¬lename. Mathematics 2022,10, 3260 11 of Feature Exploration and Hypothesis TestingIn this subsection, the linguistic, psychological and readability features extracted fromthe reviews and used in the sentiment-based review classiļ¬cation are explored. A summaryof the descriptive statistics of the features under each category linguistic, psychologicaland readability are provided in Table 4. This summary includes the number of datarecords N, mean, median, standard deviation SD, maximum Max and minimum Minvalues of the features under each category. Moreover, the signiļ¬cance of the featuresrelated to the three categories is examined using hypothesis testing. In order to select theright signiļ¬cance test, the normality of the features is examined. To obtain a sense of thedistributions of features and outcome variable, histograms and associated distributioncurves are plotted as depicted in Figure 6. It is noteworthy that only CLI has a well behavedbell-shaped distribution curve while all other features are not. To conļ¬rm this observation,normal probability plots for all features are provided in Figure 7. A normal probability plotdemonstrates the deviation of record distribution from normality. It has been observed thatthe Adv, Adj and Clout distributions deviate slightly from normal distribution. However,all other feature distributions except CLI are not normally investigate the association between input features, a correlation matrix is the probability distributions of most features are not Gaussian, it is not possible to usePearson correlation to check the relationship between features. In contrast, the Spearmancorrelation coefļ¬cient is an efļ¬cient tool to quantify the linear relationship between con-tinuous variables that are not normally distributed [36]. As this is the case of our inputfeatures, Spearman correlation has been adopted in this study to quantify the associationbetween the features. A heat map of the Spearman correlation coefļ¬cient is created andpresented in Figure 8. The circleās size is indicative of the strength of bivariate correlation map of Figure 8reveals a strong relationship between anger and negativeemotions and between ARI and CLI features, and a moderate association between NE andsadness and between Dic and ARI and CLI. However, the map shows weaker associationbetween the other input features. As the outcome, polarity class, is a categorical variable,the correlation coefļ¬cient is not an adequate tool to measure its association with the inputfeatures. Therefore, Binomial Logistic Regression LR has been adopted to investigate thisassociation. Logistic Regression assesses the likelihood of an input feature being linked toa discrete target variable [37]. The input features do not exhibit high multicollinearity, asdeducted from the correlation matrix plot of Figure 8, which makes the LR a suitable test ofassociation for our problem. Table 5displays the output of a Binomial Logistic Regressionmodel that was ļ¬tted to predict the outcome based of the linguistic, psychological andreadability feature values. The p-values and signiļ¬cance levels for each of the regressionmodelās coefļ¬cients are listed in Table 5. The asterisks denote the level of the featureāssigniļ¬cance; more asterisks imply a higher level of importance. If the associated p-valueis less than three asterisks are used to denote signiļ¬cance, two asterisks are used torepresent signiļ¬cance if the corresponding p-value is in the range [ one asteriskreļ¬ects a p-value between and and no asterisk is for p-values larger than Asshown in Table 5, the p-values for PE, NE, Ang, Sad, Clout, Adj and CLI indicate that thesefeatures are statistically signiļ¬cant to the polarity class. Mathematics 2022,10, 3260 12 of 20Figure and probability distribution curves for linguistic, physiological, readabilityfeatures and polarity class probability plots for linguistic, physiological, readability features and polarity classvariables. Mathematics 2022,10, 3260 13 of 20Table statistics summary of linguistic, psychological, readability features and NE Ang Sad Clout Dic Adv Adj WC ARI CLI PolarityMean 175 0 0 0 0 0 0 12 ā ā 0Max 99 1304 1N2000 2000 2000 2000 2000 2000 2000 2000 2000 2000 2000 2000Table signiļ¬cance of linguistic, psychological and readability features using BinomialLogistic S-Error t-Statistics p-Value QualityPE ā ā **NE ***Ang ***Sad *Clout ***Dic **WC ā ā ***Figure Correlation coefļ¬cient matrix of linguistic, psychological and readability Chi-square hypothesis test is conducted to verify the sufļ¬ciency of the LR modelto test a featureās signiļ¬cance. The null hypothesis of the test, H0, assumes that there isno relationship between the response variable, the polarity and any of the input features, all model coefļ¬cients except the intercept are zero. On the other hand, the alternativehypothesis, H1, implies that if any of the predictorās coefļ¬cients is not zero, then thelearning model is called efļ¬cient. The p-value of the Chi-square test of the model wasrecorded as 1988 degrees of freedom on 2000 observations for all indicates that the LR model differs statistically from a constant model with only theintercept term and can be considered as an adequate test of feature signiļ¬cance. As aresult, the null hypothesis can be rejected, and the association between the input featuresin predicting the polarity of a review is conļ¬rmed. As depicted in Table 4, the binomial LRreveals that all psychological features are signiļ¬cant. However, only Adj from the linguistic Mathematics 2022,10, 3260 14 of 20features and CLI from the readability features are signiļ¬cant. Therefore, only signiļ¬cantfeatures are used for review classiļ¬cation in this Evaluation Measure and Performance ComparisonThe evaluation of the deep-learning and conventional models is carried out by calcu-lating the performance measures accuracy, precision, recall and F-Measure. These perfor-mance measures are calculated on the basis of a confusion matrix. The details of confusionmatrixes are given Confusion MatrixA confusion matrix is also known as an error matrix and is used for measuring theperformance of a classiļ¬cation model. A confusion matrix is represented in Figure a review is an actual negative, and the model is predicted as positive it is calledfalse positive FP. When a review is an actual positive, and the model is predicted to bepositive, it is called true positive TP. When a review is an actual positive, and the model ispredicted as negative, it is called false negative FN. When a review is an actual negative,and the model is predicted as negative, it is called true negative TN.TP FPFN TNPositive1 Negative0Positive1Negative0Actual ValuesPredicted ValuesFigure 9. Confusion Pretrained Word EmbeddingThe pretrained word embedding Glove experimented with two different words em-bedding word vector dimensions 150 and 300. The 6 ML classiļ¬ers are used with 150 wordvector dimensions and each word vector is tested. The experiments with 150 and 300 wordvector dimension and their results are shown in Tables 6and 6. Results of pretrained model of vector dimension of 150 Accuracy Precision Recall F ScoreMulti-LayerPerceptron NearestNeighborā Forest Bayes VectorMachine Mathematics 2022,10, 3260 15 of 20Table 7. Pretrained model of vector dimension 300 Training Accuracy Average Testing Accuracy AverageCNN the movie review dataset preprocessing, it is passed to the 10-fold stratiļ¬ed cross-validation for the unbiased splitting of the dataset. The Glove pretrained model for featureengineering process is used. The 150 dimensions of the Glove pretrained model are used asa feature for ML models. The six ML algorithms are applied and SVM achieves the bestresults concerning other algorithms NB, RF, LR, KNN and MLP on the evaluation measuresof accuracy, precision, recall and F-Measure. The highest F-Measure score achieved is SVM, which is the impact of the pretrained Glove Model with 150 dimensions offeature vectors. The ML algorithm performs better on the 150 dimension vector of MLP, three layers are used with 20 neurons at each layer to predict review impact of the pretrained Glove Model having 300 dimensions is represented inTable 7. The two DL models are applied to features having a vector dimension of used models are CNN and Bi-GRU and the best results are achieved with Bi-GRUwith testing accuracy. The lowest dimension of the pretrained model is 150, whichleaves a higher impact on the results using the traditional ML algorithm compared to the300 dimensions using the DL Review-Based Trained Word2Vec Model Word EmbeddingThe reviews are embedded into vectors with three different word vector size dimen-sions, 50, 100 and 150. Then, the ML and DL algorithms are applied to varying sizes ofvectors of 28 dimensions independently and evaluated. Finally, the results are shown inTable 8based on the 8. Trained Model on reviews with 50 word vector dimension evaluation Accuracy Precision Recall F ScoreNaive Bayes Forest VectorMachine 50 dimensions of the Word2Vec model are self-trained on movie reviews. Afterthat self-trained model, it is used for word embedding of the movie reviews into vectorsrepresenting the meaning of that word. Then, the six ML algorithms are applied. The SVMachieves the best results compared to other algorithms, NB, RF, LR, KNN and MLP, on theevaluation measures accuracy, precision, recall and F-Measure. The highest F Measure scoreachieved is using SVM with 50 word embedding dimension, which is the impact ofthe self-trained model with a smaller number of dimensions. In Table 9, the 100 dimensionparameter of the self-trained model is evaluated using a confusion matrix. Mathematics 2022,10, 3260 16 of 20Table 9. Without the pretrained model with a 100 word vector dimension evaluation Accuracy Precision Recall F ScoreNaive Bayes NearestNeighbor Forest VectorMachine 100 dimensions of the Word2Vec model are self-trained on movie reviews. Afterthat model is self-trained, it is used for word embedding of the movie reviews into vectorsrepresenting the meaning of that word. Then, the six ML algorithms are applied. The SVMachieves the best results compared to other algorithms, NB, RF, LR, KNN and MLP on theevaluation measures accuracy, precision, recall and F-Measure. The highest F-Measurescore achieved is using SVM with 100 word embedding dimensions, which is theimpact of the self-trained model with a higher number of dimensions than previous Table 10, the impact of 150 dimensions of the self-trained model is trained on reviews of 150 word vector dimension without psychological, linguisticand readability features evaluation Accuracy Precision Recall F ScoreNaive Bayes NearestNeighbor Forest VectorMachine 150 dimensions of the Word2Vec model are self-trained on movie reviews. First,the context size of the model is set to 10 and the Skip-Gram Method is used to train theWord2Vec model. After that model is self-trained, it is used for word embedding of themovie reviews into vectors representing the meaning of that word. Then, the six MLalgorithms are applied. The SVM achieves the best results compared to other algorithms,NB, RF, LR, KNN and MLP on the evaluation measures accuracy, precision, recall andF-Measure. The highest F-Measure score achieved is using SVM with 150 wordembedding dimensions, which is the impact of the self-trained model with a higher numberof dimensions than the previous 50 and 100 dimension results. In Table 11, the impactof 150 dimensions of the self-trained model in addition to psychological, linguistic andreadability features is deļ¬ned. The 150 dimension self-trained model with proposedfeatures is considered because it shows better results than the pretrained Glove psychological features are extracted using LIWC. Next, the psychological featuresused in this experiment are positive emotion, negative emotion, anger, sadness, clout anddictionary words. CLIās readability feature is used because it gave a better result in theprevious experiment. Mathematics 2022,10, 3260 17 of 20Table trained on reviews of 150 word vector dimension with psychological, linguistic andreadability features evaluation Accuracy Precision Recall F ScoreNaive Bayes NearestNeighbor Forest VectorMachine the six ML algorithms are applied. The SVM achieves the best results withrespect to other algorithms, NB, RF, LR, KNN and MLP, on the accuracy, precision, recall andF-Measure evaluation measures. The highest F-Measure score achieved is using psychological, linguistic and readability features improve the evaluation 12 shows the impact of 300 dimensions of the self-trained model concerning results on word embedding 150 word vectors with psychological and Training Average Accuracy Testing AccuracyCNN 2 Layers evaluation result of two DL algorithms applied to 300 dimension vectors withoutpsychological and readability features. The impact on accuracy of 300 dimensions of theself-trained model is higher than the 300 dimensions of the pretrained model. The resultsshow that the method of embedding that is context-based gives higher results with respectto global based embedding. The applied models are CNN with two layers with 32 and64 neurons, respectively. Bi-GRU is used, which has two gates; one is an updated gate andthe other is a reset gate. The update gate is used to the retain memory and the reset gateis used to forget memory. The best results are achieved with Bi-GRU with testingaccuracy as compared to the pretrained Glove evaluation results of two DL algorithms applied on 300 word vectors with psy-chological and readability features are given in Table results on word embedding 300 word vectors with psychological, linguistic andreadability Training Accuracy Average Testing Accuracy AverageCNN the psychological features are extracted using LIWC. The psychological featuresused in this experiment are positive emotion, negative emotion, anger, sadness, clout anddictionary words. CLIās readability feature gave a better result in the previous applied models are CNN with two layers with 32 and 64 neurons, Bi-GRU has two gates; one is an updated gate and the other is a reset gate. The updatedgate is used to retain the memory and the reset gate is used to forget the memory. Bi-GRUachieves the best results with testing accuracy compared to the pretrained GloveModel. In Table 14, a comparison is given between the proposed work and the previouswork based on evaluation measures. Mathematics 2022,10, 3260 18 of 20Table 14. Comparison of F-Measure of Proposed work with Previous Embedding Model Classiļ¬er F-MeasureReview based trainedWord2Vec Support Vector Machine Word2Vec [16] CNN-BLSTM Word2Vec [22] LSTM [18] Maximum Entropy analysis of the results following the experiment is given below.ā¢The self-trained Word2Vec model on movie reviews with 150 dimension parameterhas a higher impact on performance than the pretrained Glove Model.⢠The CLI readability achieved the highest score compared to ARI and WC.ā¢The SVM algorithm performs better than the applied algorithms NB, LR, RF, CNN,KNN and MLP.ā¢The use of the psychological and readability feature CLI to classify reviews withself-trained embedding improves the performance from 86% to smaller number of words embedding dimension 150 performs better concerningthe traditional ML algorithm and for the DL algorithm 300 dimensions gives a ConclusionsClassiļ¬cation of opinion mining of reviews is open research due to the continuousincrease in available data. Many approaches have been proposed to achieve classiļ¬cationof movie reviews. After a critical analysis of the literature, we observe that words areconverted into vectors for sentiment classiļ¬cation of movie reviews by different approaches,including TF-IDF and Word2Vec. The pretrain model of Word2Vec is commonly used forword embedding into vectors. Mostly generalized data are used to train the Word2Vecmodel for extracting features from reviews. We extract features by training the Word2Vecmodel on speciļ¬c data related to 50 thousand reviews. For review classiļ¬cation, theWord2Vec model is trained on reviews. Most researchers used a generalized trained modelfor review classiļ¬cation as an alternative. This research work extracts features from moviereviews using a review-based trained Word2Vec model and LIWC. The review-basedtrained data have some characteristics. They include 6 million vocabularies of the wordand are speciļ¬c to movie reviews related to the task of sentiment classiļ¬cation of six ML algorithms are applied, and SVM achieves the best result of F-Measurewith respect to other algorithms NB, RF, LR, KNN and DL algorithms are also applied. One is CNN and the other is Bi-GRU. Bi-GRUachieved which is greater than the results CNN achieved. The results conclude thatthe data used for model training perform better than the model trained on generalized the ML algorithm,150 features perform better than 50 and 100 features for theused movie review dataset. The DL model 300 feature vectors perform better classiļ¬cationsthan the 150 feature vectors. Signiļ¬cant psychological, linguistic and readability featuresaided in improving the classiļ¬cation performance of the used classiļ¬ers. SVM achievedan F-Measure with 150 word vector size and BiGRU achieved the same F-Measurescore using 300 word vector size. We applied both traditional ML and DL algorithmsfor the classiļ¬cation of reviews. Both achieved nearly the same results on a performancemeasure that proves that the dataset of IMDb movie reviews having 50,000 is not enoughfor applying a DL algorithm. In future work, a larger dataset is needed to apply the DLalgorithm to increase the classiļ¬cation performance of ContributionsConceptualization, Muhammad Shehrayar Khan, Saleem Khan, Muhammad Shehrayar Khan, and methodology, Muhammad Shehrayar Khan, Muhammad Saleem Khan, and Mathematics 2022,10, 3260 19 of software Muhammad Shehrayar Khan, Muhammad Saleem Khan, and validation, Muhammad Shehrayar Khan, MuhammadSaleem Khan, and formal analysis, Muhammad Shehrayar Khan, and investigation, Muhammad Shehrayar Khan, MuhammadSaleem Khan, and resources, Muhammad Shehrayar Khan; data curation, Saleem Khan; writing original draft preparation, Muhammad Shehrayar Khan, Muhammad Saleem Khan, and writing review and editing, Shehrayar Khan, Muhammad Saleem Khan, and Muhammad Shehrayar Khan, Muhammad Saleem Khan, and supervision, project administration, funding acquisition, and authors have read and agreed to the published version of the This research received no external Availability StatementI declare that the data considered for this research is original andcollected by the authors for generating insights. Moreover, the data mining & ML tools consideredfor this research are freely available and built the models in accordance with our own of InterestThe authors declare that there is no conļ¬ict of interest related to this Wang, S.; Fang, H.; Khabsa, M.; Mao, H.; Ma, H. Entailment as Few-Shot Learner. arXiv 2021, arXiv Nguyen, Dao, Sentiment Analysis of Movie Reviews Using Machine Learning Techniques. InProceedings of Sixth International Congress on Information and Communication Technology, London, UK, 25ā26 February 2021;Springer Berlin, Germany, 2022; pp. 361ā U.; Khan, S.; Rizwan, A.; Atteia, G.; Jamjoom, Samee, Aggression Detection in Social Media from Textual DataUsing Deep Learning Models. Appl. Sci. 2022,12, 5083. [CrossRef] T.; Faisal, Rizwan, A.; Alkanhel, R.; Khan, Muthanna, A. Efļ¬cient Fake News Detection Mechanism UsingEnhanced Deep Learning Model. Appl. Sci. 2022,12, 1743. [CrossRef] Rizwan, A.; Iqbal, K.; Fasihuddin, H.; Banjar, A.; Daud, A. Prediction of Movie Quality via Adaptive Voting Access 2022,10, 81581ā81596. [CrossRef] A.; Abbas, Y.; Ahmad, T.; Mahmoud, Rizwan, A.; Samee, A Healthcare Paradigm for Deriving KnowledgeUsing Online Consumersā Feedback. Healthcare 2022,10, 1592. [CrossRef] A.; Agrawal, A.; Rath, Classiļ¬cation of sentiment reviews using n-gram machine learning approach. Expert 2016,57, 117ā126. [CrossRef] Mohamed, Haggag, A survey on opinion summarization techniques for social media. Future J. 2018,3, 82ā109. [CrossRef] I.; Varma, Govardhan, A. Preprocessing the informal text for efļ¬cient sentiment analysis. Int. J. Emerg. TrendsTechnol. Comput. Sci. IJETTCS 2012,1, 58ā Shenoy, Mohan, Aspect term extraction for sentiment analysis in large movie reviews using Gini Indexfeature selection method and SVM classiļ¬er. World Wide Web 2017,20, 135ā154. [CrossRef] B.; Lee, L.; Vaithyanathan, S. Thumbs up? Sentiment classiļ¬cation using machine learning techniques. arXiv2002,arXivcs/ Bakar, Yaakub, A review of feature selection techniques in sentiment analysis. Intell. Data 159ā189. [CrossRef] M.; Harish, B. A New Feature Selection Method based on Intuitionistic Fuzzy Entropy to Categorize TextDocuments. Int. J. Interact. Multimed. Artif. Intell. 2018,5, 106. [CrossRef] M.; Brooke, J.; Toļ¬loski, M.; Voll, K.; Stede, M. Lexicon-based methods for sentiment analysis. Comput. 267ā307. [CrossRef] A.; Zhang, D.; Levene, M. Combining lexicon and learning based approaches for concept-level sentiment of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining, Beijing, China, 12 August2012; pp. 1ā L.; Wang, H.; Gao, S. Sentiment feature selection for sentiment analysis of Chinese online reviews. Int. J. Mach. 2018,9, 75ā84. [CrossRef] S.; Kar, Baabdullah, A.; Al-Khowaiter, Big data with cognitive computing A review for the future. Int. J. 2018,42, 78ā89. [CrossRef]18. Fink, L.; Rosenfeld, L.; Ravid, G. Longer online reviews are not necessarily better. Int. J. Inf. Manag. 2018,39, 30ā37. [CrossRef] L.; Goh, Jin, D. How textual quality of online reviews affect classiļ¬cation performance A case of deep learning sentimentanalysis. Neural Comput. Appl. 2020,32, 4387ā4415. [CrossRef] Mathematics 2022,10, 3260 20 of Z. Sentiment Analysis of Movie Reviews based on Machine Learning. In Proceedings of the 2020 2nd InternationalWorkshop on Artiļ¬cial Intelligence and Education,Montreal, QC, Canada, 6ā8 November 2020; pp. 1ā Karim, M.; Das, S. Sentiment analysis on textual reviews. IOP Conf. Ser. Mater. Sci. Eng. 2018,396, 012020. [CrossRef] H.; Harish, B.; Darshan, H. Sentiment Analysis on IMDb Movie Reviews Using Hybrid Feature Extraction Method. Int. Multimed. Artif. Intell. 2019,5, 109ā114. [CrossRef] R. Sentiment analysis of movie reviews using heterogeneous features. In Proceedings of the 2018 2nd InternationalConference on Electronics, Materials Engineering & Nano-Technology IEMENTech, Kolkata, India, 4ā5 May 2018; pp. 1ā Chaurasia, S.; Srivastava, Sentiment short sentences classiļ¬cation by using CNN deep learning model withļ¬ne tuned Word2Vec. Procedia Comput. Sci. 2020,167, 1139ā1147. [CrossRef] Liu, Luo, X.; Wang, L. An LSTM approach to short text sentiment classiļ¬cation with word embeddings. InProceedings of the 30th conference on computational linguistics and speech processing ROCLING 2018, Hsinchu, Taiwan, 4ā5October 2018; pp. 214ā Z.; Zulļ¬qar, Xiao, C.; Azeem, M.; Mahmood, T. Sentiment analysis on IMDB using lexicon and neural Appl. Sci. 2020,2, 1ā10. [CrossRef] A.; Mukhopadhyay, S.; Panigrahi, Goswami, S. Utilization of oversampling for multiclass sentiment analysis onAmazon review dataset. In Proceedings of the 2019 IEEE 10th International Conference on Awareness Science and TechnologyiCAST, Morioka, Japan, 23ā25 October 2019; pp. 1ā A.; Akhilesh, V.; Aich, A.; Hegde, C. Sentiment analysis of restaurant reviews using machine learning techniques. InEmerging Research in Electronics, Computer Science and Technology; Springer Berlin, Germany, 2019; pp. 687ā Ghosh, Valveny, E.; Harit, G. Beyond visual semantics Exploring the role of scene text in image Recognit. Lett. 2021,149, 164ā171. [CrossRef] L.; Wang, G.; Zuo, Y. Research on patent text classiļ¬cation based on Word2Vec and LSTM. In Proceedings of the 2018 11thInternational Symposium on Computational Intelligence and Design ISCID, Hangzhou, China, 8ā9 December 2018; Volume 1,pp. 71ā Q.; Dong, H.; Wang, Y.; Cai, Z.; Zhang, L. Recommendation of crowdsourcing tasks based on Word2Vec semantic tags. Mob. Comput. 2019,2019, 2121850. [CrossRef] PeƱa, Breis, San RomĆ”n, I.; Barriuso, Baraza, Snomed2Vec Representation of SNOMED CTterms with Word2Vec. In Proceedings of the 2019 IEEE 32nd International Symposium on Computer-Based Medical SystemsCBMS, Cordoba, Spain, 5ā7 June 2019; pp. 678ā A.; Khatua, A.; Cambria, E. A tale of two epidemics Contextual Word2Vec for classifying twitter streams duringoutbreaks. Inf. Process. Manag. 2019,56, 247ā257. [CrossRef] T.; Mao, Q.; Lv, M.; Cheng, H.; Li, Y. Droidvecdeep Android malware detection based on Word2Vec and deep beliefnetwork. KSII Trans. Internet Inf. Syst. TIIS 2019,13, 2180ā T.; Chen, K.; Corrado, G.; Dean, J. Efļ¬cient estimation of word representations in vector space. arXiv2013,arXiv C.; Dehon, C. Inļ¬uence functions of the Spearman and Kendall correlation measures. Stat. Methods 497ā515.[CrossRef]37. Collett, D. Modelling Binary Data; CRC Press Boca Raton, FL, USA, 2002. ResearchGate has not been able to resolve any citations for this healthcare agencies HHCAs provide clinical care and rehabilitation services to patients in their own homes. The organizationās rules regulate several connected practitioners, doctors, and licensed skilled nurses. Frequently, it monitors a physician or licensed nurse for the facilities and keeps track of the health histories of all clients. HHCAsā quality of care is evaluated using Medicareās star ratings for in-home healthcare agencies. The advent of technology has extensively evolved our living style. Online businessesā ratings and reviews are the best representatives of organizationsā trust, services, quality, and ethics. Using data mining techniques to analyze HHCAsā data can help to develop an effective framework for evaluating the finest home healthcare facilities. As a result, we developed an automated predictive framework for obtaining knowledge from patientsā feedback using a combination of statistical and machine learning techniques. HHCAsā data contain twelve performance characteristics that we are the first to analyze and depict. After adequate pattern recognition, we applied binary and multi-class approaches on similar data with variations in the target class. Four prominent machine learning models were considered SVM, Decision Tree, Random Forest, and Deep Neural Networks. In the binary class, the Deep Neural Network model presented promising performance with an accuracy of However, in the case of multiple class, the random forest model showed a significant outcome with an accuracy of Additionally, variable significance is derived from investigating each attributeās importance in predictive model building. The implications of this study can support various stakeholders, including public agencies, quality measurement, healthcare inspectors, and HHCAs, to boost their performance. Thus, the proposed framework is not only useful for putting valuable insights into action, but it can also help with retrieval from huge social web data is a challenging task for conventional search engines. Recently, information filtering recommender systems may help to find movies, however, their services are limited because of not considering movie quality aspects in detail. Prediction of movies can be improved by using the characteristics of social web content about a movie such as social-quality, tag quality, and a temporal aspect. In this paper, we have proposed to utilize several features of social quality, user reputation and temporal features to predict popular or highly rated movies. Moreover, enhanced optimization-based voting classifier is proposed to improve the performance on proposed features. Voting classifier uses the knowledge of all the candidate classifiers but ignores the performance of the model on different classes. In the proposed model, weight is assigned to each model based on its performance for each class. For the optimal selection of weights for the candidate classifiers, Genetic Algorithm is used and the proposed model is called Genetic Algorithm Voting GA-V classifier. After labeling the suggested features by using a fixed threshold, several classifiers like Bayesian logistic regression, NaĆÆve Bayes, BayesNet, Random Forest, SVM, Decision Tree, LSTM and AdaboostM1 are trained on MovieLens dataset to find high-quality/popular movies in different categories. All the traditional ML models are compared with GA-V in terms of precision, recall and F1 score. The results show the significance of the proposed features and proposed GA-V KhanSalabat KhanAtif Rizwan Nagwan AbdelsameeIt is an undeniable fact that people excessively rely on social media for effective communication. However, there is no appropriate barrier as to who becomes a part of the communication. Therefore, unknown people ruin the fundamental purpose of effective communication with irrelevantāand sometimes aggressiveāmessages. As its popularity increases, its impact on society also increases, from primarily being positive to negative. Cyber aggression is a negative impact; it is defined as the willful use of information technology to harm, threaten, slander, defame, or harass another person. With increasing volumes of cyber-aggressive messages, tweets, and retweets, there is a rising demand for automated filters to identify and remove these unwanted messages. However, most existing methods only consider NLP-based feature extractors, TF-IDF, Word2Vec, with a lack of consideration for emotional features, which makes these less effective for cyber aggression detection. In this work, we extracted eight novel emotional features and used a newly designed deep neural network with only three numbers of layers to identify aggressive statements. The proposed DNN model was tested on the Cyber-Troll dataset. The combination of word embedding and eight different emotional features were fed into the DNN for significant improvement in recognition while keeping the DNN design simple and computationally less demanding. When compared with the state-of-the-art models, our proposed model achieves an F1 score of 97%, surpassing the competitors by a significant spreading of accidental or malicious misinformation on social media, specifically in critical situations, such as real-world emergencies, can have negative consequences for society. This facilitates the spread of rumors on social media. On social media, users share and exchange the latest information with many readers, including a large volume of new information every second. However, updated news sharing on social media is not always this study, we focus on the challenges of numerous breaking-news rumors propagating on social media networks rather than long-lasting rumors. We propose new social-based and content-based features to detect rumors on social media networks. Furthermore, our findings show that our proposed features are more helpful in classifying rumors compared with state-of-the-art baseline features. Moreover, we apply bidirectional LSTM-RNN on text for rumor prediction. This model is simple but effective for rumor detection. The majority of early rumor detection research focuses on long-running rumors and assumes that rumors are always false. In contrast, our experiments on rumor detection are conducted on real-world scenario data set. The results of the experiments demonstrate that our proposed features and different machine learning models perform best when compared to the state-of-the-art baseline features and classifier in terms of precision, recall, and F1 growth of social networking web users, people daily shared their ideas and opinions in the form of texts, images, videos, and speech. Text categorization is still a crucial issue because these huge texts received from the heterogeneous sources and different mindset peoples. The shared opinion is to be incomplete, inconsistent, noisy and also in different languages form. Currently, NLP and deep neural network methods are widely used to solve such issues. In this way, Word2Vec word embedding and Convolutional Neural Network CNN method have to be implemented for effective text classification. In this paper, the proposed model perfectly cleaned the data and generates word vectors from pre-trained Word2Vec model and use CNN layer to extract better features for short sentences find out what other people think has been an essential part of information-gathering behaviors. And in the case of movies, the movie reviews can provide an intricate insight into the movie and can help decide whether it is worth spending time on. However, with the growing amount of data in reviews, it is quite prudent to automate the process, saving on time. Sentiment analysis is an important field of study in machine learning that focuses on extracting information of subject from the textual reviews. The area of analysis of sentiments is related closely to natural language processing and text mining. It can successfully be used to determine the attitude of the reviewer in regard to various topics or the overall polarity of the review. In the case of movie reviews, along with giving a rating in numeric to a movie, they can enlighten us on the favorableness or the opposite of a movie quantitatively; a collection of those then gives us a comprehensive qualitative insight on different facets of the movie. Opinion mining from movie reviews can be challenging due to the fact that human language is rather complex, leading to situations where a positive word has a negative connotation and vice versa. In this study, the task of opinion mining from movie reviews has been achieved with the use of neural networks trained on the āMovie Review Databaseā issued by Stanford, in conjunction with two big lists of positive and negative words. The trained network managed to achieve a final accuracy of 91%.Duc Duy Tran Sang NguyenTran Hoang Chau DaoSentiment analysis is the interpretation and classification of emotions and opinions from the text. The scale of emotions and opinions can vary from positive to negative and maybe neutral. Customer sentiment analysis helps businesses to point out the publicās thoughts and feelings about their products, brands, or services in online conversations and feedback. Natural language processing and text classification are crucial for sentiment analysis. That means we can predict or classify customersā opinions given their comments. In this paper, we do sentiment analysis in the two different movie review datasets using various machine learning techniques including decision tree, naĆÆve Bayes, support vector machine, blending, voting, and recurrent neural networks RNN. We propose a few frameworks of sentiment classification using these techniques on the given datasets. Several experiments are conducted to evaluate them and compared with an outstanding natural language processing tool Stanford CoreNLP at present. The experimental results have shown our proposals can achieve higher performance, especially, the voting and RNN-based classification models can result in better with visual and scene text content are ubiquitous in everyday life. However, current image interpretation systems are mostly limited to using only the visual features, neglecting to leverage the scene text content. In this paper, we propose to jointly use scene text and visual channels for robust semantic interpretation of images. We not only extract and encode visual and scene text cues but also model their interplay to generate a contextual joint embedding with richer semantics. The contextual embedding thus generated is applied to retrieval and classification tasks on multimedia images with scene text content to demonstrate its effectiveness. In the retrieval framework, we augment the contextual semantic representation with scene text cues to mitigate vocabulary misses that may have occurred during the semantic embedding. To deal with irrelevant or erroneous scene text recognition, we also apply query-based attention to the text channel. We show that our multi-channel approach, involving contextual semantics and scene text, improves upon the absolute accuracy of the current state-of-the-art methods on Advertisement Images Dataset by in the relevant statement retrieval task and by 5% in the topic classification task.
Overthe period we review, the field has seen major advances regarding the automated detection of strongly obfuscated and thus hard-to-identify forms of academic plagiarism. These improvements mainly originate from better semantic text analysis methods, the investigation of non-textual content features, and the application of machine learning.
TextAnalytics is the process of converting unstructured text data into meaningful data for analysis, to measure customer opinions, product reviews, feedback, to provide search facility, sentimental analysis and entity modeling to support fact based decision making. Text analysis uses many linguistic, statistical, and machine learning techniques.
Textmining is an automatic process that uses natural language processing to extract valuable insights from unstructured text. By transforming data into information that machines can understand, text mining automates the process of classifying texts by sentiment, topic, and intent. Thanks to text mining, businesses are being able to analyze
. pm1ysy54c2.pages.dev/782pm1ysy54c2.pages.dev/307pm1ysy54c2.pages.dev/407pm1ysy54c2.pages.dev/368pm1ysy54c2.pages.dev/966pm1ysy54c2.pages.dev/503pm1ysy54c2.pages.dev/506pm1ysy54c2.pages.dev/141pm1ysy54c2.pages.dev/307pm1ysy54c2.pages.dev/894pm1ysy54c2.pages.dev/130pm1ysy54c2.pages.dev/445pm1ysy54c2.pages.dev/642pm1ysy54c2.pages.dev/242pm1ysy54c2.pages.dev/573
language features of review text