Prelims

N. Padmaja Dr. (SRI Padmavati Mahila Visvavidyalayam, India)
Rajalakshmi Subramaniam Dr. (Talaash Research Consultants, India)
Sanjay Mohapatra Dr. (Batoi Systems Pvt Ltd, India)

Big Data Analytics for the Prediction of Tourist Preferences Worldwide

ISBN: 978-1-83549-339-7, eISBN: 978-1-83549-338-0

Publication date: 22 February 2024

Citation

Padmaja, N., Subramaniam, R. and Mohapatra, S. (2024), "Prelims", Big Data Analytics for the Prediction of Tourist Preferences Worldwide, Emerald Publishing Limited, Leeds, pp. i-xiv. https://doi.org/10.1108/978-1-83549-338-020241009

Publisher

:

Emerald Publishing Limited

Copyright © 2024 N. Padmaja, Rajalakshmi Subramaniam and Sanjay Mohapatra. Published under exclusive licence by Emerald Publishing Limited


Half Title Page

Big Data Analytics for the Prediction of Tourist Preferences Worldwide

Title Page

Big Data Analytics for the Prediction of Tourist Preferences Worldwide

By

Dr. N. Padmaja

SRI Padmavati Mahila Visvavidyalayam, India

Dr. Rajalakshmi Subramaniam

Talaash Research Consultants, India

And

Dr. Sanjay Mohapatra

Batoi Systems Pvt Ltd, India

United Kingdom – North America – Japan– India – Malaysia – China

Copyright Page

Emerald Publishing Limited

Emerald Publishing, Floor 5, Northspring, 21-23 Wellington Street, Leeds LS1 4DL

First edition 2024

Copyright © 2024 N. Padmaja, Rajalakshmi Subramaniam and Sanjay Mohapatra.

Published under exclusive licence by Emerald Publishing Limited.

Reprints and permissions service

Contact: www.copyright.com

No part of this book may be reproduced, stored in a retrieval system, transmitted in any form or by any means electronic, mechanical, photocopying, recording or otherwise without either the prior written permission of the publisher or a licence permitting restricted copying issued in the UK by The Copyright Licensing Agency and in the USA by The Copyright Clearance Center. Any opinions expressed in the chapters are those of the authors. Whilst Emerald makes every effort to ensure the quality and accuracy of its content, Emerald makes no representation implied or otherwise, as to the chapters' suitability and application and disclaims any warranties, express or implied, to their use.

British Library Cataloguing in Publication Data

A catalogue record for this book is available from the British Library

ISBN: 978-1-83549-339-7 (Print)

ISBN: 978-1-83549-338-0 (Online)

ISBN: 978-1-83549-340-3 (Epub)

List of Figures and Tables

Chapter 1
Fig. 1.1. Sources of Big Data. 5
Chapter 3
Fig. 3.1. Design of Proposed System Flow Diagram. 28
Chapter 4
Fig. 4.1. Predicting Preferences of International and Domestic Tourists Using Association Rule Mining Proposed System Flow Diagram. 41
Fig. 4.2. Excellent Feature. 46
Fig. 4.3. Clean Feature. 47
Fig. 4.4. Comfortable Feature. 47
Fig. 4.5. Friendly Feature. 48
Fig. 4.6. Negative Feature. 49
Fig. 4.7. Expensive Feature. 49
Fig. 4.8. Small Feature. 50
Fig. 4.9. Good Hotel Feature. 51
Fig. 4.10. Hot Water Feature. 52
Fig. 4.11. Helpful Staff Feature. 52
Fig. 4.12. Good Location Feature. 53
Fig. 4.13. Good Value Feature. 54
Fig. 4.14. Features of Domestic New Delhi Hotels. 55
Fig. 4.15. Features of Beijing Hotels. 55
Fig. 4.16. Features of Chicago Hotels. 56
Fig. 4.17. Features of Dubai Hotels. 56
Fig. 4.18. Features of London Hotels. 57
Fig. 4.19. International Tourism of Montreal Hotels. 57
Fig. 4.20. Features of New York Hotels. 58
Fig. 4.21. Features of San Francisco Hotels. 58
Fig. 4.22. Features of Shanghai Hotels. 59
Fig. 4.23. Features of Vegas Hotels. 59
Fig. 4.24. Results of International Tourism Beijing, Shanghai, Las Vegas and Chicago Hotels. 61
Fig. 4.25. Results of International Tourism Montreal, New York, New Delhi and San Francisco Hotels. 62
Fig. 4.26. Results of International Tourism London and Dubai Hotels. 63
Chapter 5
Fig. 5.1. Pointwise Mutual Information Proposed System Flow Diagram. 70
Fig. 5.2. Count of Beijing Features. 74
Fig. 5.3. Count of Chicago Features. 74
Fig. 5.4. Count of Dubai Features. 75
Fig. 5.5. Count of Las Vegas Features. 76
Fig. 5.6. Count of London Features. 76
Fig. 5.7. Count of Montreal Features. 77
Fig. 5.8. Count of New Delhi Features. 78
Fig. 5.9. Count of San Francisco Features. 78
Fig. 5.10. Count of Shanghai Features. 79
Fig. 5.11. Count of Features and Cities. 80
Chapter 6
Fig. 6.1. Multiclass Multilabel Classification Proposed System Flow Chart. 85
Fig. 6.2. Result of Beijing. 91
Fig. 6.3. Result of Chicago. 92
Fig. 6.4. Result of Dubai. 92
Fig. 6.5. Result of Las Vegas. 93
Fig. 6.6. Result of London. 93
Fig. 6.7. Result of Montreal. 94
Fig. 6.8. Result of New Delhi. 94
Fig. 6.9. Result of New York. 95
Fig. 6.10. Result of San Francisco. 95
Fig. 6.11. Result of Shanghai. 96
Fig. 6.12. Features of All Cities. 99
Chapter 7
Fig. 7.1. Loss Function of LDA Topic Modelling. 105
Fig. 7.2. Loss Function of Doc2Vec. 106
Fig. 7.3. Loss Function of TF-IDF Features. 106
Fig. 7.4. Accuracy vs. Feature Type. 107

Chapter 2
Table 2.1. Reviews of Big Data in the Tourism Sector. 17
Table 2.2. Benefits of Big Data in Tourism Sector. 20
Table 2.3. Challenges of Big Data in the Tourism Sector. 22
Table 2.4. Applications of Big Data in the Tourism Sector. 25
System Requirements. 35
Chapter 4
Table 4.1. Good Hotel Feature. 50
Table 4.2. Hot Water Feature. 51
Table 4.3. Helpful Staff Feature. 52
Table 4.4. Good Location Feature. 53
Table 4.5. Good Value Feature. 53
Chapter 6
Table 6.1. Ratings of Cities and Their Features. 96
Table 6.2. Ratings of Cities Using LDA. 97
Table 6.3. Ratings of Cities Using Doc2Vec. 97
Table 6.4. Ratings of Cities Using TF-IDF. 98
Table 6.5. Ratings of Cities With Their Features and Matrix Values in Ratings of Cities. 98

List of Abbreviations

BI

Business Intelligence

IoT

Internet of Things

LDA

Latent Dirichlet Allocation

NLP

Natural Language Processing

PMI

Ponitwise Mutual Information

TF-IDF

Term Frequency-Inverse Document Frequency

UGC

User Generated Content

Preface

The embracement of big data and machine learning techniques in tourism industry has been presumed as a mean to direct the challenges and issues of the smart development of tourism destinations and organizations by redesigning the margins of the competition and improving its pattern and design as information intensive business. In the process of their journeys and in their decision-making processes, people who travel contribute to the generation of a huge flow of data created by sensors and micro-devices disbursed on the urban and other areas of interest for travellers. All these information is a potential base for creating smart destinations and also for improving the tourism organizations' potential to customize their products and service offerings. Though, the real execution of such inventive forms of data-driven value generation in tourism continues to be more restricted to the theory or executed in a few excellent cases. Indeed, big data and machine learning techniques in tourism continues to be an unclear concept and a subject of investigation that necessitates closer analysis from an extensive range of field and research methods. Comprehensibly with this viewpoint, the current special issue aimed to deal with development of the discussion on Smart Tourism by concentrating on big data as a promising model which is re-shaping the concept and practice of tourism. In particular, the underlying principle of the study is to explore benefits, importance and challenges of big data in smart tourism with theoretical and experiential contributions and intend to derive meaning for smart tourism. Known the changes that digital era is introducing in the tourism industry and with the intention of supporting managers that function in this industry, it is significant to look into various challenges and issues included on how big data could contribute to decision-making processes in tourism organization. This issue makes an effort to cover these subjects and to consider the status of big data and smart tourism study. The study also intends to analyze the benefits of big data and various machine learning techniques evident for those in the area of tourism and to comprehend the effectiveness of, and opportunities presented within. The tourism sector is a wide business area, but the keystone of this sector is the ambience part of it and hotels are considered the influential in that field. Running a profitable business could be millstone for every tourism organization and necessitate hotel organizations to hire more staffs in which they have to afford financially and manually. Three different techniques have been used in the current study, i.e. association rule mining, point wise mutual information technique (PMI) and neural network such as multiclass and multilabel classification algorithms. These three classification models have been used in order to check the accuracy of the results, i.e. TF-IDF, Doc2Vec and LDA. In order to generate tourists' profile and information, different big data algorithms and machine learning techniques have been recommended and applied to tourism text data assessment. The study concluded that accuracy is high for the TF-IDF approach. Successful cases are analyzed by elucidating the main opportunities and challenges for the generation of official tourist information. Throughout the growth and interpretation of the knowledge of tourism, every time as a result of the poor understanding of the leader or extreme dependence on big data, there might be a misunderstanding that big data could do anything. It results in blind decision-making, and consequently many technical inputs might not bring in the desired effect. The awareness and understanding of big data for users is considered important. The use of big data is a conventional application in the marketing policy. The study contributes to literature on value generation from big data and it's relating technologies. This paper also contributed to explain the role of big data in transforming the structure of tourism industry, which is dominated by deep-rooted business logics. The study offers original contributions which progress knowledge regarding challenges and importance for smart tourism with respect to big data. It also deals with state-of-the-art technique and related practice, and emerging challenges and issues and areas for future investigations are examined in this study.