Autores
Rodríguez Díaz Carlos Alberto
Gelbukh Alexander
Título Measuring semantic gap between user-generated content and product descriptions through compression comparison in e-commerce
Tipo Revista
Sub-tipo JCR
Descripción Information Sciences
Resumen The significance of user-generated content as a source for business intelligence and analytics has been on the rise since the inception of electronic commerce platforms and has been solidified in the wake of the pandemic due to the prominence of electronic commerce as a sales channel. The prevailing approach to harnessing unstructured data involves the utilization of Artificial Intelligence; however, there exist simpler alternatives capable of yielding valuable information. This article introduces a methodology grounded in information theory to quantify the semantic disparity between the consumer community and product descriptions. This disparity can result in potential misunderstandings in the dialogue among consumers, and incidental costs in the dialogue between consumers and vendors. One plausible explanation for this disparity is that the terminology employed by consumers may possess different meanings compared to that utilized by product description writers. Our methodology employs large corpora of consumer reviews and product descriptions to quantify this semantic disparity across multiple electronic commerce domains through the implementation of random word exchanges and compression. Furthermore, we utilize neural word embeddings to identify specific words exhibiting the greatest semantic drift between reviews and descriptions, thereby providing lexical examples of these gaps. Our findings indicate that lower levels of lexical-semantic gap are associated with better consumer satisfaction. © 2023 Elsevier Inc.
Observaciones DOI 10.1016/j.ins.2023.118953
Lugar New York
País Estados Unidos
No. de páginas Article number 118953
Vol. / Cap. v. 638
Inicio 2023-08-01
Fin
ISBN/ISSN