Loading...

Media is loading
 

Paper Type

Complete

Description

The issue of fake reviews, a perennial challenge faced by e-commerce companies, has been worsened in recent years. In this paper, we investigate the trade-off between the application of general-purpose and domain-specific model by selecting training data using transformer-based models to identify fake reviews. Therefore, we compare two scenarios using different set ups of data selection. First, a general-purpose model was identified and applied on specific domains. Afterwards, domain-specific models were trained and tested. Then, the results from these scenarios were compared, yielding the conclusion that models trained on data from a similar product category outperform general-purpose models in classification performance up to 21% (on average by 4%). Our findings send an important message to e-commerce companies to rethink their strategy on training general-purpose models to identify fake reviews to create several, more domain-specific models on that task according to present data domains.

Paper Number

1230

Comments

Social

Share

COinS
 
Aug 10th, 12:00 AM

Fake Review Detection - The Value of Domain-Specificity

The issue of fake reviews, a perennial challenge faced by e-commerce companies, has been worsened in recent years. In this paper, we investigate the trade-off between the application of general-purpose and domain-specific model by selecting training data using transformer-based models to identify fake reviews. Therefore, we compare two scenarios using different set ups of data selection. First, a general-purpose model was identified and applied on specific domains. Afterwards, domain-specific models were trained and tested. Then, the results from these scenarios were compared, yielding the conclusion that models trained on data from a similar product category outperform general-purpose models in classification performance up to 21% (on average by 4%). Our findings send an important message to e-commerce companies to rethink their strategy on training general-purpose models to identify fake reviews to create several, more domain-specific models on that task according to present data domains.

When commenting on articles, please be friendly, welcoming, respectful and abide by the AIS eLibrary Discussion Thread Code of Conduct posted here.