An Apache open source project, Hadoop stores huge amounts of data in safe, reliable storage and runs complex queries over data in an efficient way. It is at the core of a whole host of the most popular Big Data tools. Mastering Hadoop ensures you get
Read More »Computer
Mining the Web: Discovering Knowledge from Hypertext Data
This is is the first book devoted entirely to techniques for producing knowledge from the vast body of unstructured Web data. Building on an initial survey of infrastructural issues-including Web crawling and indexing.The author examines low-level ma
Read More »Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools
This practical book teaches the skills that scientists need for turning large sequencing datasets into reproducible and robust biological findings. Many biologists begin their bioinformatics training by learning scripting languages like Python and R
Read More »Machine Learning and Data Mining Lecture Notes
This book has been written as an introduction to the main issues associated with the basics of machine learning and the algorithms used in data mining.It offers a thorough grounding in machine learning concepts as well as practical advice on applying
Read More »Twitter Data Analytics
This book provides methods for harnessing Twitter data to discover solutions to complex inquiries. The brief introduces the process of collecting data through Twitter's APIs and offers strategies for curating large datasets.The book gives examples of
Read More »O’Reilly Think Stats, 2nd Edition: Exploratory Data Analysis in Python
If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs
Read More »Python Scripting for Spatial Data Processing
This book is a Python tutorial for beginners aiming at teaching spatial data processing. It is used as part of the courses taught in Remote Sensing and GIS at Aberystwyth University, UK.Geographic information systems belong the group of applications
Read More »Analyzing Linguistic Data: A Practical Introduction to Statistics using R
Statistical analysis is a useful skill for linguists and psycholinguists, allowing them to understand the quantitative structure of their data. This textbook provides a straightforward introduction to the statistical analysis of language. Designed fo
Read More »Introduction to Data Science, with Introduction to R
This book provides non-technical readers with a gentle introduction to essential concepts and activities of data science. For more technical readers, the book provides explanations and code for a range of interesting applications using the open sourc
Read More »Big Data on Real-World Applications
As technology advances, high volumes of valuable data are generated day by day in modern organizations. The management of such huge volumes of data has become a priority in these organizations, requiring new techniques for data management and data an
Read More »