scispace - formally typeset
Search or ask a question
Author

František Dařena

Bio: František Dařena is an academic researcher from Mendel University. The author has contributed to research in topics: Natural language & Information system. The author has an hindex of 6, co-authored 34 publications receiving 112 citations.

Papers
More filters
Book
31 Oct 2019
TL;DR: In this paper, the authors provide a perspective on the application of machine learning-based methods in knowledge discovery from natural languages texts using the popular R-language with its implemented machine learning algorithms.
Abstract: This book provides a perspective on the application of machine learning-based methods in knowledge discovery from natural languages texts. By analysing various data sets, conclusions which are not normally evident, emerge and can be used for various purposes and applications. The book provides explanations of principles of time-proven machine learning algorithms applied in text mining together with step-by-step demonstrations of how to reveal the semantic contents in real-world datasets using the popular R-language with its implemented machine learning algorithms. The book is not only aimed at IT specialists, but is meant for a wider audience that needs to process big sets of text documents and has basic knowledge of the subject, e.g. e-mail service providers, online shoppers, librarians, etc. The book starts with an introduction to text-based natural language data processing and its goals and problems. It focuses on machine learning, presenting various algorithms with their use and possibilities, and reviews the positives and negatives. Beginning with the initial data pre-processing, a reader can follow the steps provided in the R-language including the subsuming of various available plug-ins into the resulting software tool. A big advantage is that R also contains many libraries implementing machine learning algorithms, so a reader can concentrate on the principal target without the need to implement the details of the algorithms her- or himself. To make sense of the results, the book also provides explanations of the algorithms, which supports the final evaluation and interpretation of the results. The examples are demonstrated using realworld data from commonly accessible Internet sources.

22 citations

Book ChapterDOI
01 Sep 2011
TL;DR: A text mining-method is described how to discover words that are significant for expressing different opinions (positive and negative) using very large sets of customers' reviews concerning the on-line hotel room booking.
Abstract: Opinions expressed by text documents freely written in various natural languages represent a valuable source of knowledge that is hidden in large datasets The presented research describes a text mining-method how to discover words that are significant for expressing different opinions (positive and negative) The method applies a simple but unified data pre-processing for all languages, providing the bag-of-words with words represented by their frequencies in the data Then, the frequencies are used by the algorithm which generates decision trees The tree decisive nodes contain the words that are significant for expressing the opinions Positions of these words in the tree represent their significance degree, where the most significant word is in the node As a result, a list of relevant words can be used for creating a dictionary containing only relevant information The described method was tested using very large sets of customers' reviews concerning the on-line hotel room booking For more than 15 languages, there were available several millions of reviews The resulting dictionaries included only about 200 significant words

16 citations

Journal ArticleDOI
TL;DR: This paper is focused on the analysis of a route planning system which could be used as a part of Supply Chain Management information system or as a standalone application, and describes basic techniques and frameworks of transportation problems as well as important functional requirements, considering recent trends in the field of distribution planning.
Abstract: Today, the demand for creating a systematic approach for managing sales, ordering, and logistics has increased Supply Chain Management (SCM) is one of the responses to problems that have arose with the need for managing complex supply chains Nowadays, most of the activities of Supply Chain Management is realized or supported with computing technologies Route planning is an important part of Supply Chain Management related to both procurement and distribution Route planning systems specify the sequences in which the selected transport vehicles should supply the demand points by requested quantities of goods at the right time The paper is focused on the analysis of a route planning system which could be used as a part of Supply Chain Management information system or as a standalone application It describes basic techniques and frameworks of transportation problems as well as important functional requirements, considering recent trends in the field of distribution planning As a result, functional specification of basic features and other components of system are provided The paper is a result of a joint initiative of the authors and a vendor of business information systems

10 citations

Journal ArticleDOI
TL;DR: A new method for chatbot platform evaluation is proposed and the proposed method for the chatbot selection is demonstrated on two sample businesses – a large bank and a small taxi service.
Abstract: Chatbots are going to be the main tool for automated conversations with customers. Still, there is no consistent methodology for choosing a suitable chatbot platform for a particular business. This paper proposes a new method for chatbot platform evaluation. To describe the current state of chatbot platforms, two high-level approaches to chatbot platform design are discussed and compared. WYSIWYG platforms aim to simplicity but may lack some advanced features. All-purpose chatbot platforms require extensive technical skills and are more expensive but give their users more freedom in chatbot design. We provide an evaluation of six major chatbot solutions. The proposed method for the chatbot selection is demonstrated on two sample businesses - a large bank and a small taxi service.

10 citations

Journal ArticleDOI
TL;DR: It has been shown that at least part of the movement of stock prices is associated with the textual content if a proper combination of processing parameters is selected.
Abstract: The paper presents the result of experiments that were designed with the goal of revealing the association between texts published in online environments (Yahoo! Finance, Facebook, and Twitter) and changes in stock prices of the corresponding companies at a micro level. The association between lexicon detected sentiment and stock price movements was not confirmed. It was, however, possible to reveal and quantify such association with the application of machine learning-based classification. From the experiments it was obvious that the data preparation procedure had a substantial impact on the results. Thus, different stock price smoothing, lags between the release of documents and related stock price changes, five levels of a minimal stock price change, three different weighting schemes for structured document representation, and six classifiers were studied. It has been shown that at least part of the movement of stock prices is associated with the textual content if a proper combination of processing parameters is selected.

7 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

01 Jan 2002

9,314 citations

01 Jan 2009
TL;DR: Wang et al. as discussed by the authors developed a fixed effect log-linear regression model to assess the influence of online reviews on the number of hotel room bookings, which indicated a significant relationship between online consumer reviews and business performance of hotels.
Abstract: Despite hospitality and tourism researchers’ recent attempts on examining different aspects of online word-of-mouth [WOM], its impact on hotel sales remains largely unknown in the existing literature. To fill this void, we conduct a study to empirically investigate the impact of online consumer-generated reviews on hotel room sales. Utilizing data collected from the largest travel website in China, we develop a fixed effect log-linear regression model to assess the influence of online reviews on the number of hotel room bookings. Our results indicate a significant relationship between online consumer reviews and business performance of hotels.

877 citations