Modern Big Data Processing with Hadoop
上QQ阅读APP看书,第一时间看更新

Data acquisition

Even though not a part of data wrangling, this phase deals with the process of acquiring data from somewhere. Typically, all data is generated and stored in a central location or is available in files located on some shared storage.

Having an understanding of this step helps us to build an interface or use existing libraries to pull data from the acquired data source location.