What goes into designing and configuring a Data Lake? How does Hadoop figure in? Is security a concern? Our own Mike Vogel, Big Data Architect, answers these questions and more in this video Q&A. Continue reading A Plunge Into the Data Lake: Q&A with Mike Vogel
Our client was experiencing problems with managing their data assets. Data ingestion and processing were not managed in an organized way, causing users to waste time and effort when trying to find data. The large volume and variety of structured and unstructured data were only expected to increase as the organization grew. IT was struggling to balance the requirements of users with building a scalable, solid data foundation.
To meet their growing data needs, our client wanted fast, searchable, on-demand access to data and a way to conduct analytics and visualization on the data that users selected.
If you’ve ever wanted to know how a Data Lake works, look no further than this month’s infographic.
Proponents of the Data Lake are quick to point out the many potential benefits. For example, data will be readily available for end-user consumption, or users will have access to the “full historical view” of data and will be able to make innovative connections between different types of data (such as relational data, documents, or images). Continue reading The Data Lake: Is It All About the Metadata?
Editor’s Note: This post from Alex Beals, rockstar big data intern at Knowledgent, shares his experiences and the important lessons he learned during his time here.
As the last summer before college, and my freshman year at Dartmouth College, comes to a close, it’s the perfect time to think about the wonderful opportunity I had to work at Knowledgent as an intern. This was the first time I was able to get real-world experience in the field of computer science (what I’m planning on majoring in), and it allowed me to hone my skills while being able to contribute on a team and further the company. Continue reading Contributing in a Real-World Environment: Three Takeaways from My Internship
Depending on your perspective and what you are trying to achieve, “quality” takes on different meanings. That being said, most people would probably agree that data quality relates to some degree of “correctness”. However, data can be correct in different ways.
So, what is “quality”? Many might say that quality is having the correct information, but that is only part of the data quality story. I like to think of quality as a three-legged stool, where each “leg” or characteristic is critical in supporting the overall structure.
What’s in a name? Everything.
A name is an idea. It carries with it many dimensions of meaning that shape understanding. So why do we continue to speak in the clumsy and imprecise parlance of the day, embracing such emergent monikers as “big data“? (Wasn’t data always big? Isn’t it the economics of using that data that have favorably changed?) Simply put, the trends are racing in so many directions, changing the data landscape so quickly, that in order to keep up, we must employ the language being used, lest we fall short of the productive connections we seek to make. Continue reading What’s in a name? DAaaS is more than semantics.
Editor’s Note: This post from Amanda Sullivan, rockstar intern at Knowledgent, shares her experiences and the important lessons she learned during her time here.
I started my internship with Knowledgent following my freshman year at Rowan University. With my studies focusing on Business Administration and Communications, this summer I was given the opportunity to experience the corporate field first hand. I wanted exposure to the industry I plan on pursuing later in life.
OODT (Object Oriented Data Technology) is an open-source, data management framework currently distributed by Apache. Originally from NASA’s Jet Propulsion Laboratory, this component-oriented software was developed to focus on access to science data repositories and data generation and capture.
We sat down with Arpan Bhattacharya, Big Data Engineer here at Knowledgent, to pick his brain on the advantages of using OODT: Continue reading Q&A: Arpan Bhattacharya on the Advantages of OODT