I learned a lot from the class meeting overview in which Professor Dennis explained the three V models plus another V (velocity, volume, variety, and veracity). The essential distinction between traditional and big data is that traditional data uses a centralized database architecture, while big data uses a distributed architecture. Therefore the traditional databases gather information, populate information, calculate, and report based on the predefined set of questions, queries, and data schemes. On the other hand, big data architecture is based on on-the-fly calculations on preexisted data sets. In big data, all schemas and variables create based on the questions later. Therefore, using traditional data vs. big data is related to the purpose of work and projects, and it does not mean that one is better than the other or vice-versa. So, there are cons and pros involved in using each framework. For example, we can get a more detailed analysis of the market and business models and forecasting from the big data, while the analysis is limited to traditional data architecture. Alternatively, we have more security and easier use in traditional data, while security is a massive issue in big data because of on-the-fly data manipulations and calculations.
However, what is new in big data, or what makes the big data new?
From my point of view, the new digital age where we are living now is the fundamental reason for going toward big data and making big data a new thing. Traditional data can not answer many new technical/digital economy problems, forecasts, and evaluations. As I mentioned above, it does not mean that big data is a good alternative for all purposes, and it has many problems that need to be solved, including scaling. "increasing data throughput, growing amounts of data, and streamlining technical systems to facilitate data processing." (Lugmayr et., 2017)
"Identify and discuss at least 2 public sites that provide free access to big data sets." (CTU, 2022)
AWS cloud is one of the most sophisticated, easy-to-use, and secured platforms in the big data market where there is a free tier available, and I am currently using both the free and paid AWS cloud systems in our company. AWS cloud is very cost-effective, runs high-performance queries on petabytes of both structured and non-structured data, and can generate compelling reports.
Microsoft is another prominent data service provider with high security and high-performance software and hardware infrastructure. Microsoft also has the free trier, which I did not try before.
Reference
CTU, (2022). Colorado Technical University. Student's restricted panel. Retrieved 2022, from Colorado Technical University restricted area of assignments.
Lugmayr, A., Stockleben, B., Scheib, C., & Mailaparampil, M. A. (2017). Cognitive big data: Survey and review on big data research and its implications. what is really "new" in big data? Journal of Knowledge Management, 21(1), 197-212. https://doi.org/10.1108/JKM-07-2016-0307
No comments:
Post a Comment