Last visit was: 22 Apr 2026, 03:22 It is currently 22 Apr 2026, 03:22
Close
GMAT Club Daily Prep
Thank you for using the timer - this advanced tool can estimate your performance and suggest more practice questions. We have subscribed you to Daily Prep Questions via email.

Customized
for You

we will pick new questions that match your level based on your Timer History

Track
Your Progress

every week, we’ll send you an estimated GMAT score based on your performance

Practice
Pays

we will pick new questions that match your level based on your Timer History
Not interested in getting valuable practice questions and articles delivered to your email? No problem, unsubscribe here.
Close
Request Expert Reply
Confirm Cancel
User avatar
sghaneka
Joined: 06 Dec 2004
Last visit: 21 Feb 2011
Posts: 2
Own Kudos:
1
 [1]
Posts: 2
Kudos: 1
 [1]
1
Kudos
Add Kudos
Bookmarks
Bookmark this Post
User avatar
beyondgmatscore
Joined: 14 Feb 2011
Last visit: 10 Nov 2015
Posts: 101
Own Kudos:
Given Kudos: 3
Posts: 101
Kudos: 442
Kudos
Add Kudos
Bookmarks
Bookmark this Post
User avatar
fluke
User avatar
Retired Moderator
Joined: 20 Dec 2010
Last visit: 24 Oct 2013
Posts: 1,095
Own Kudos:
Given Kudos: 376
Posts: 1,095
Kudos: 5,167
Kudos
Add Kudos
Bookmarks
Bookmark this Post
User avatar
beyondgmatscore
Joined: 14 Feb 2011
Last visit: 10 Nov 2015
Posts: 101
Own Kudos:
Given Kudos: 3
Posts: 101
Kudos: 442
Kudos
Add Kudos
Bookmarks
Bookmark this Post
I think the question is phrased ambiguously. The data set has 1000 records and 50 variables - its unclear whether there are 1000 records for each of the 50 variables or that each variable has certain number of records and the total number of records in 1000.
User avatar
fluke
User avatar
Retired Moderator
Joined: 20 Dec 2010
Last visit: 24 Oct 2013
Posts: 1,095
Own Kudos:
Given Kudos: 376
Posts: 1,095
Kudos: 5,167
Kudos
Add Kudos
Bookmarks
Bookmark this Post
A rephrasing of the question would be:

A table has 1000 rows and 50 columns with 5% of the values missing, spread randomly throughout the rows and columns. An analyst decides to remove rows that have missing values. About how many rows would you expect would be removed?
User avatar
beyondgmatscore
Joined: 14 Feb 2011
Last visit: 10 Nov 2015
Posts: 101
Own Kudos:
Given Kudos: 3
Posts: 101
Kudos: 442
Kudos
Add Kudos
Bookmarks
Bookmark this Post
Above rephrasing clarifies it better, but it would still leave the question incomplete, as in those 2500 values can be distributed across those 1000 rows in any number of ways.
User avatar
IanStewart
User avatar
GMAT Tutor
Joined: 24 Jun 2008
Last visit: 17 Apr 2026
Posts: 4,143
Own Kudos:
Given Kudos: 99
Expert
Expert reply
Posts: 4,143
Kudos: 11,268
Kudos
Add Kudos
Bookmarks
Bookmark this Post
This question is far, far beyond the scope of the GMAT. If you care about how to solve it (and if you're just studying for the GMAT, believe me, you don't care!) then you'd expect there to be on average 2.5 missing values per row. The probability of a missing value is .05, so using binomial probability, the standard deviation of the number of missing values per row is root(50*0.05*0.95) = 1.54 (approximately). You'd then need to consult a normal distribution stats table to figure out what percentage of rows you'd expect to have 0 incomplete entries (what percentage would be more than 2.5/1.54 = 1.6 standard deviations below the expected value). You don't have stats tables at hand during the GMAT, so you can't possibly answer this question on the GMAT, and the knowledge required is way beyond the test regardless.

As general advice, it's not a good idea to use, say, undergraduate probability or stats textbooks for GMAT preparation. You'll waste a lot of your time studying material that is entirely irrelevant to the test. Anyone preparing for the GMAT should ignore this thread.
User avatar
beyondgmatscore
Joined: 14 Feb 2011
Last visit: 10 Nov 2015
Posts: 101
Own Kudos:
Given Kudos: 3
Posts: 101
Kudos: 442
Kudos
Add Kudos
Bookmarks
Bookmark this Post
Ian - what you are saying would be true if we can assume these missing values are normally distributed - we don't have any such information in the question, so cant really assume a normal distribution - or can we?
User avatar
IanStewart
User avatar
GMAT Tutor
Joined: 24 Jun 2008
Last visit: 17 Apr 2026
Posts: 4,143
Own Kudos:
Given Kudos: 99
Expert
Expert reply
Posts: 4,143
Kudos: 11,268
Kudos
Add Kudos
Bookmarks
Bookmark this Post
beyondgmatscore
Ian - what you are saying would be true if we can assume these missing values are normally distributed - we don't have any such information in the question, so cant really assume a normal distribution - or can we?

This is all completely irrelevant for the GMAT, but for interest only: say you flip a coin 1000 times, and write down how many Heads you get. Then you repeat that experiment a million times, writing down a list of the numbers of Heads you get each time. Of course, on average you'll have 500 Heads, but the number of Heads in your list will be binomially (approximately normally) distributed. That's precisely where the normal distribution comes from; the normal distribution describes the distribution of the number of Heads you'll get when you flip a coin an infinite number of times. Whenever you have a sequence of independent events where each event has some fixed probability of 'success', you have a binomial distribution. So I wasn't making any assumptions (technically I was ignoring the slight dependence built into the question, but that shouldn't materially affect the answer). In any case, the normal distribution is *not* tested on the GMAT, so you don't need to know any of this.
User avatar
sghaneka
Joined: 06 Dec 2004
Last visit: 21 Feb 2011
Posts: 2
Own Kudos:
Posts: 2
Kudos: 1
Kudos
Add Kudos
Bookmarks
Bookmark this Post
Thanks so much for you advice. I was just curious looking at the problem.

Posted from my mobile device
User avatar
bumpbot
User avatar
Non-Human User
Joined: 09 Sep 2013
Last visit: 04 Jan 2021
Posts: 38,957
Own Kudos:
Posts: 38,957
Kudos: 1,117
Kudos
Add Kudos
Bookmarks
Bookmark this Post
Automated notice from GMAT Club BumpBot:

A member just gave Kudos to this thread, showing it’s still useful. I’ve bumped it to the top so more people can benefit. Feel free to add your own questions or solutions.

This post was generated automatically.
Moderator:
Math Expert
109740 posts