This course prepares participants to begin business intelligence projects with a hands-on approach to gathering and cleaning data. After taking this course, participants will be ready to create their own databases or oversee the creation of databases for their company.

The focus in this course is on “Big Data” datasets containing anywhere from tens of thousands to millions of observations. While the tools used are applicable for smaller datasets of a few hundred data points, the focus is on larger datasets. The course also helps participants with no experience in building datasets to start from scratch.

Finally, the course is excellent for users of Salesforce, Tableau, Oracle, IBM, SAP and other BI software packages since it helps viewers see through the “black box” to the underlying mechanics of Business Intelligence practices.

***Please download and open all supporting materials before starting the course videos.***

Learning Objectives
  • Explore how to critically examine databases produced by Business Intelligence (BI) software packages like those from Salesforce, Tableau, Oracle, IBM for issues or concerns
  • Recognize how to gather relevant data from publicly available databases
  • Explore how to merge datasets together based on unique identifiers to create a single useable database
  • Identify and examine data via automated tools and processes to weed out questionable or erroneous data points
Last updated/reviewed: March 13, 2024

Included In Certifications

This course is included in the following Certification Programs:

15 CoursesData Analytics Professional (DAP) Certification

  1. The Fundamentals of Business Intelligence (BI): What it Does and Why it's so Essential
  2. How is Industry Using Big Data? – A Case Study Reading
  3. Business Intelligence - Data Collection and Cleaning
  4. Business Intelligence - Structuring Data for Analysis
  5. Business Intelligence - Fundamentals of Data Analysis
  6. Business Intelligence - Using Big Data Analytics
  7. Doing Data Analytics In Excel (Hands-on module)
  8. Regressions in Excel (Hands-on module)
  9. Financial Forecasting
  10. Advanced Analytics – Omitted Variables
  11. Advanced Analytics – Fixed Effects
  12. Case Study in Data Analytics
  13. Using Regressions for Forecasting
  14. Financial Forecasting in Excel (Hands-on module)
  15. Blockchain for Business
144 Reviews (670 ratings)

Reviews

5
Member's Profile
Great review of some items and nice to learn some new tips and tricks. Benford's Law was a nice nugget to discover. Sales Reps who care about good data need to watch this. Many times they do not understand how data can be skewed and they could benefit from learning how to assess the quality of the data they are looking at. Great overview for executive too.

4
Anonymous Author
This course is a good general overview of how to collect and prepare the data for analysis. The tips for scrubbing and testing the data would certainly be helpful for developing a good database. It would be great to there could be more in depth discussion of how to utilize the data from FRED after installing the Add-Ins in excel.

5
Member's Profile
This was a well presented and interesting course, and even though I've been working with data for a while I picked up a few good tips. The Excel Add in for Federal Reserve data downloads was a huge benefit in itself. Would be nice to have an optional hands on demo of data collection and cleaning to really gain intuition for it.

5
Anonymous Author
This course is helpful to learn assessing data bases, gathering data, merging data sets, cleaning databases, and pitfalls in building datasets. since collecting, cleaning, and merging data are relevant to our day to day work, it is helpful to think about a big picture to improve our work efficiency.

5
Anonymous Author
Great class!

5
Anonymous Author
Another great refresher course presented by Michael, highlighting the essentials for collecting, scrubbing, and merging data. Perhaps the most significant key learning was leveraging FRED for macroeconomic data, and configuring the add-on within Excel. I have a new tool in my arsenal!

5
Anonymous Author
I really enjoyed learning about the different ways to collect, merge and validate data. I thought is was very interesting and useful to know that there is an excel add-in where you can get data from the Federal Reserve (Fred) as well as data from the U.S. Census, and Google Trends.

4
Anonymous Author
The Course itself was great. Michael brought up a lot of good point on how data isn't perfect and users need to be careful with how they use it if they want meaningful results. Users need to be critical of the data they use both before and after analysis.

4
Member's Profile
Great course, the documents are relevant are properly explain the material covered. I would argue that some of it is outdated and should consider tools available to be used with Excel. Some of the questions in the test need to be reviewed too.

4
Anonymous Author
very informative section, I had never known of the data available at the fed site, and also Benford's law was very cool. It might be useful to also discuss adding data columns or making data more consistent in order to make modeling easier.

4
Anonymous Author
This course discussed methods on how to test that your data isn't faulty. Learning how you should test a few things to make sure you're pulling the most accurate data is helpful in a project I'm currently working on.

4
Member's Profile
The course highlights some key strategies to "clean" your data. Before this course, I did not know the importance of cleaning data. I now feel that I can take these strategies and directly improve my analysis.

5
Anonymous Author
Great course. One observation: Question 4 on the final exam was poorly worded. Choices A,B,C all said "usually the Best..." but then D was all of the above. All cannot be the best. Makes no sense.

4
Anonymous Author
I liked the real world examples he used. I also liked the pitfalls to look out for so that you do not make decisions based on a faulty dataset. Many people do not question data integrity enough.

5
Anonymous Author
Michael was excellent in his delivery on a rather complex subject, and made it easy to follow along. This helped me realize how important data collection and cleaning really is to companies.

5
Anonymous Author
I liked the organization of information in the course, and the steps provided to collect and prepare data for analysis. Recommendations were given for how yo tackle different scenarios.

5
Anonymous Author
I used to be a statistics TA, and the material is not always easy to each. However, the instructor broke down the introduction to data, sampling, and potential errors very well.

3
Anonymous Author
Questions in exam did not relate to materials. You will need to take the other modules in the certification program to get the answers to this exam questions for this course.

4
Member's Profile
This was better than the first lesson. I liked the topic that talked about limiting the extremes in a data set. Sometimes, as in life, we get too focused on the outliers.

5
Anonymous Author
The course explained the importance of learning about using appropriate data to achieve predictable results and how to obtain data that will satisfy those conditions.

5
Member's Profile
Thank you for this course. It was knowledgeable and helpful for me to learn about the process of data accessing. Learned a great deal that I did not previously know.

5
Anonymous Author
It was interesting to learn that an Excel add-in is available for accessing US Federal Reserve data. I'll be investigating this possibility for Canadian data.

5
Anonymous Author
This course was useful in learning the value a creating databases. Knowing how to check for errors and what types of questions can be answered was also useful

5
Anonymous Author
I liked learning about the FRED tool and I think this course overall offers some very sound advice on the basics of data. Would definitely recommend it.

4
Anonymous Author
Some interesting topics were insufficiently covered by this course. E.g. biased data and other ethic problems related to gathering and cleaning data.

5
Anonymous Author
Another great course. Very direct and to the point; lays out topics at the beginning and then sticks to the agenda without any extraneous information.

5
Member's Profile
I enjoy the use of steps that you need to take in order to create and/or purchase data. It's not as easy as people think. It was well covered here

4
Anonymous Author
This course served as an effective introduction to data gathering and merging techniques. I enjoyed learning about Benford's law and Winsorizing.

5
Anonymous Author
I appreciated the instructor taking the time to define words and then work them into his lesson. It increased the quality of the lesson.

4
Anonymous Author
Content is well organized and a few examples of data sources are provided. Note: Not all questions in the exam related to the content.

5
Member's Profile
Neat Class. i loved this course as I learned new technical terms and methodologies. Easy to follow presenter. Thank you, Larry

4
Anonymous Author
In an age where information is too many, this topic is very timely to assess how to ensure that information gathered is relevant.

4
Anonymous Author
This course was useful in learning the value of creating and using data bases. It helped show the importance of good clean data.

5
Member's Profile
This was a quality course related to Data Collection and cleaning 0- I thought that the information provided was very relevant.

5
Anonymous Author
Very good background! Very good breakout of the steps that need to be taken to collect, clean, merge, and analyze final data.

4
Member's Profile
Course was helpful in taking some issues in gathering data explored in a statistics class and putting it in plain English.

5
Anonymous Author
great course!!!!!!! going to show my grandkids this course one day so they can learn all this rockin information too!!!!!

4
Anonymous Author
There were some good tips in the course. I hope the next levels provide some additional detail in addition to these tips.

3
Anonymous Author
I found the theoretical information interesting; however, struggled to pick up the excel examples in the webcast format.

4
Member's Profile
The material here was presented in a clear and concise manner that made it very easy to grasp the introduced concepts.

4
Member's Profile
Well done Pretty good course on data cleaning. Provided a solid overview. Test questions could be a bit more refined.

4
Member's Profile
this is a good overview of dealing with data and some things to look for when compiling and analyzing the data sets

4
Anonymous Author
Well thought out and presented. Nice 30,000 foot view on what data is and how to work with it. No complaints here!

5
Anonymous Author
This course is helpful for beginners on assessing databases, gathering data, merging datasets, cleaning databases.

5
Anonymous Author
Great introductory course covering data gathering, merging large data sets, and data cleaning.

4
Anonymous Author
Good content as I am trying to better understand the BI process and this was a deeper dive then the prior course.

4
Anonymous Author
I enjoyed the course from an intellectual standpoint, but unclear if I'll be able to apply it to my profession.

5
Anonymous Author
This is a good basic overview of databases and collecting data. It doesn't go crazy in depth, but is an intro.

5
Member's Profile
It introduced me to statistical analysis CONCEPTS that I think I missed in my statistics classes in college…

5
Anonymous Author
This course was very useful. It was a good follow up to the Business Intelligence: What is it and why course.

1
Anonymous Author
Again. I personally cannot learn this kind of subject from a lecture. I would have to go through an example.

5
Member's Profile
Excellent course - I very much enjoyed the material and found it to be particularly insightful and rewarding.

5
Anonymous Author
Interesting course with a lot of specific examples which help to illustrate the practicality of the subject.

4
Member's Profile
Business Intelligence - Data Collection and Cleaning gives you a good grasp of gathering data and processing.

4
Member's Profile
This was helpful for the real world. I enjoyed how I learned things that could help me be better at my job.

4
Member's Profile
A lot of useful information given in this course. I will retake this course to catch what I may have missed.

5
Member's Profile
Very good course as a guidanfe and intro to Business Intelligence, i got a lot of tips vey ussefull for me.

5
Anonymous Author
Thank you so much for this great learning experience. I look forward to continueing with the CDAP program.

4
Anonymous Author
Merging and cleaning database info very helpful and never knew about Feds info add-ins that was a surprise!

4
Anonymous Author
I think the questions given were applicable to the training and although tricky I considered them useful.

5
Anonymous Author
Breaking up the information into easy to digest modules is a great way to really digest this information

5
Member's Profile
excellent insight into the steps required to address bi. excellent presentation of the course materials

5
Member's Profile
Interesting Course that I will lean on as I take additional BI courses!

4
Anonymous Author
It met all the stated objectives. I thought it was more interesting than the previous two sections.

5
Anonymous Author
A great course! I strongly recommend to anyone interested in learning more about gathering data.

5
Anonymous Author
Awesome course that touches on everything without spending too much time on any given sub-topic.

4
Anonymous Author
I felt the valid test data question was confusing, but other than that, it was a good course!

5
Member's Profile
Supporting material followed nicely with the videos. Explanations were easy to comprehend.

5
Anonymous Author
this course is very informative, i look forward to implementing in my every day practice.

5
Anonymous Author
This course helped me prepare for a training. I really feel like I got a good foundation.

5
Member's Profile
Slides were extremely helpful to review, concepts laid out very plainly but informatively

3
Anonymous Author
This was a really dry course, more relatable examples would have held my attention longer

4
Member's Profile
Pretty good presentation with some good information on the subject of BI Data Collection.

5
Member's Profile
Loved learning about Benford's law. I also did not know a stock ticker was not unique!

4
Member's Profile
this was a very engaging webcast and I think I will be able to use this going forward

5
Anonymous Author
Course was informative and easy to navigate. I would recommend it to a colleague.

5
Member's Profile
Very insightful, clearly explained and presented in a highly professional manner.

5
Anonymous Author
Great information and very informative. Great information and very informative.

4
Anonymous Author
Provided useful information on a gathering perspective of data. Nice refresher.

5
Anonymous Author
Helped lay out different tools and approaches to collecting and cleaning data.

5
Member's Profile
Informative course, real examples separate from slides help drive concepts.

5
Member's Profile
Great instructor, Learned a lot and would suggest this to a college student.

4
Anonymous Author
good course well done good job 50 characters. windsorizing is a funny name

5
Member's Profile
i thought that it was good. i don't have real thoughts, i'm just learning.

5
Anonymous Author
Good course and easy to follow. Excellent. Would recommend to a friend.

5
Member's Profile
Helpful for understanding datasets and making sure your data is not fake

5
Member's Profile
Thorough explanations and good insight into techniques for merging data

3
Anonymous Author
very good training! I gained a lot of insight in taking this course!!!!

4
Anonymous Author
I think it would have been helpful to walk through an example in excel.

4
Anonymous Author
Great course and helped teach the ways to best go about gathering data

3
Anonymous Author
The course broke down each concept into easily understandable topics.

4
Anonymous Author
Pretty high level, don't understand why this field has to be so long.

5
Anonymous Author
Good course. Wish the answers to the quiz were available afterwards.

5
Anonymous Author
This course provided great examples of data collection techniques.

5
Member's Profile
Excellent course, very interesting topic and applicable knowledge.

4
Member's Profile
this course was a great learning experience for tax professionals

5
Anonymous Author
learned ideas about excel that I did not know before, well done!

4
Member's Profile
Another good course to begin understanding Business Intelligence

5
Anonymous Author
Excellent insights into data collecting, merging, and verifying.

4
Anonymous Author
Good presentation on the basics of data gathering and merging.

4
Member's Profile
Good techniques for testing data are introduced in this course.

4
Member's Profile
Good introduction and overview of data collection and cleaning.

5
Anonymous Author
Good Training. Good tips to gather data and take good decisions

4
Anonymous Author
Good overview of some ways to gather data and how to sort it.

4
Anonymous Author
Great overview for collecting, cleaning and merging data.

3
Member's Profile
Great course with great information on how to manage data.

3
Member's Profile
great course on cleaning and gathering data. Learned a lot!

4
Member's Profile
Solid intro course - I'd be interested in the next level!

3
Member's Profile
Federal Reserve Bank data was useful information to have.

5
Member's Profile
The course made complex concepts very easy to understand.

5
Anonymous Author
Pretty interesting, especially the stuff on benfords law

5
Anonymous Author
Great course, I really enjoyed, the instructor is great

4
Member's Profile
Learned concepts on data base issues. Neat presentation

3
Member's Profile
some interesting facts here. also learned a few things

5
Member's Profile
Really good overview of data gathering and collection.

5
Anonymous Author
Everything about this course was absolutely fantastic.

5
Member's Profile
The materials were easy to follow. Nice tone of voice.

4
Member's Profile
the topics surrounding data integrity were interesting

4
Anonymous Author
The course's pacing was appropriate and informative.

5
Anonymous Author
Thank you very much for the excellent presentation!

5
Member's Profile
I liked this course. Intersting subject matter.

3
Anonymous Author
changed my mindset as to the importance of BI

5
Anonymous Author
Very interesting information, good course

4
Anonymous Author
Useful material, well structured course

5
Anonymous Author
This material knocked my socks off!!!

4
Member's Profile
Very clear, concise and informative.

5
Member's Profile
I enjoyed the powerpoint very much

5
Member's Profile
Great course! Very informative.

4
Anonymous Author
enjoyable and informative

5
Member's Profile
Informative lesson

5
Member's Profile
insightful class

5
Member's Profile
Excellent Course

5
Member's Profile
liked the course

5
Anonymous Author
Great overview

5
Anonymous Author
Great course.

4
Anonymous Author
Great Lesson

5
Member's Profile
great course

5
Anonymous Author
Good course.

5
Anonymous Author
Great video!

4
Member's Profile
Good course

5
Anonymous Author
Very clear

5
Anonymous Author
Informative

4
Member's Profile
Very good

5
Member's Profile
Great

Prerequisites
Course Complexity: Intermediate

No Advanced Preparation or Prerequisites are needed for this course. However, it is recommended to take the other courses in the series prior to completing this one.

Education Provider Information
Company: Illumeo, Inc., 75 East Santa Clara St., Suite 1215, San Jose, CA 95113
Contact: For more information regarding this course, including complaint and cancellation policies, please contact our offices at (408) 400- 3993 or send an e-mail to .
Instructor for this course
Course Syllabus
INTRODUCTION and OVERVIEW
Collecting, Cleaning, and Merging Data
  Assessing Data Bases 5:32
  Gathering Data10:08
  Merging Data Sets 7:05
  Cleaning Databases 8:47
CONCLUSION
  Pitfalls In Building Datasets 9:08
Continuous Play
  Business Intelligence – Collecting, Cleaning, and Merging Data 41:40
SUPPORTING MATERIALS
  Slides: BI Collecting, Cleaning, and Merging DataPDF
  BI Collecting, Cleaning, and Merging Data Glossary/IndexPDF
REVIEW and TEST
  REVIEW QUESTIONSquiz
 FINAL EXAMexam