-
Notifications
You must be signed in to change notification settings - Fork 16
Raw Data
jeromyanglim edited this page Oct 19, 2014
·
3 revisions
The raw-data folder is for processing highly uncleaned data.
The rationale is that files placed in the data folder should be somewhat close what will be used.
It is also often useful where major data processing is quite time consuming, and only needs to be done occasionally or almost never when new raw data arrives.
Examples include:
* Name of data.frame is wrong
* Variable names are not as desired
* Substantial manual overriding and modification is required
* Identifying information or other material needs to be removed
* Assumption is that this code does not need to be run every time
* Commonly this is required where system provides problematic data