Mar 16, 2018
overall the good introductory course of python for data science but i feel it should have covered the basics in more details .specially for the ones who do not have any prior programming background .
Dec 10, 2017
Wow, this was amazing. Learned a lot (mostly thanks to stack overflow) but the course also opened my eyes to all the possibilities available out there and I feel like i'm only scratching the surface!
автор: Amin s•
Dec 04, 2019
terrible course please improve teaching efficiency and give a proper realistic assignments
автор: Adeel J•
Oct 29, 2017
Instructor is not interested in teaching but just reading out the script
автор: Sourav S•
Jun 26, 2018
Horrible. Explains nothingJohns Hopkins data science is FAR superior
Aug 27, 2017
Vedio is useless, I only need to go through the code.
автор: james c•
May 01, 2019
Disconnected, too broad, could have been great.
Nov 02, 2018
Worst class ever! The instructor sucks!
автор: Qi X•
Jan 31, 2019
autograding system is terrible
автор: Bugra S•
Jan 02, 2018
assigments are not clear
автор: Aino J•
Jan 06, 2020
This was a great course and I learnt a lot! Topics covered include a quick reminder on intermediate python and lots on pandas and some numpy. The weeks 3 and 4 assignments are quite challenging so expect to spend considerably more time than indicated on the course site if you're not experienced with python and pandas. This course is not for coding newbies.
I am proficient in R for data analysis and had dabbled with python before although had no experience with pandas. I was committed to learn the course material and to spend a substantial amount of time doing so. The speed of lectures is fast. I paused often to take notes and to try out the provided notebooks, and I returned to some of the videos when working on the assignments. I found the course assignments good and challenging. The lectures give a good tour of different functions and approaches you may want to use in the assignments, but there isn't much handholding with the assignments and you'll most likely spend quite of bit of time looking things up online in pandas docs and stackoverflow. If you're used to that and generally troubleshooting code, you'll probably be just fine. I spent much more time on the assignments than what is estimated on the site: ~5h for week 2 (vs 1.5h indicated), ~1 day for week 3 (vs 2h), and 2.5 days for week 4 (vs 4h).
Week 1 gives a refresher on how to write functions, list comprehensions, and lambdas in python. If you're familiar with writing loops and functions in other languages, with this material you will get to writing them in python quickly if you invest a bit of time and effort. If you're not yet at the level of confidently writing functions, loops and vectorized alternatives in python or another language, I'd recommend starting with a different, more basic course because the learning curve with this one might be too steep.
Week 2 gives the ins and outs of pandas including creating and querying pandas series and data frames.
Week 3 covers merging data frames, grouping (groupby) with aggregation (agg), applying functions rowwise (apply), and pivoting data (pivot_table) etc. It also gives a whirlwind tour of date/time manipulation using pandas. numpy is also included.
Week 4 has some lectures on distributions and more on numpy. The week consists mainly of the main project assignment where 50% of points are given on data cleaning and munging (contents of weeks 1-3) and the other 50% of points are on modelling and hypothesis testing. It's quite a proper project in the sense that you're given a number of non-clean data files scraped from different places and a hypothesis to test. There are some additional instructions on what format of cleaned data to produce from the different files and what type of test to perform, but for the rest you're on your own.
автор: LE G X•
Nov 19, 2016
This course was excellent. This course deviates from many garbage MOOC who only work with quiz and can not provide a real python coding challenge experience. Assignements are really tough. But my sense of progress is real.(I have struggled to identify such feel in many pytyhon MOOC). Jupyter base for everythjng is a fantatsic format(it even allows coding mobility betwwen my station at work and my home station through the coding on jupyter in the cloud) . My feedback nevertheless will point to some aspect in my experience and where I think you can improve.
Succeeding the assignement does not mean that we identified the most elegant way to apply all the knowledge of the course(lambdas,list comprehension, grouping..., apply) in our coding. Breaking that barrier is not easy for me unless we are forced at it and so my looping mind is often applied in assignments. A real correction with the answer need to be provided(this is what the real classroom would do, we managed to get to the answer but we could still learn more with an assisted correction just like what the real classroom would do.I understand that you are worry that the model will end up as copy paste on a webpage and will kill your value. You could maybe consider this add_on for paid customers only and only provide it in picture way which can only be paper print and not so easily converted to webpage format.Or you need to find an alegant way to randomize the assignment coding test at each coursera session, which in that case would not bring any forgery issue and you could provide the correction at the end of the course(or after each assigment completed).
Videos are a bit too fast on concepts sometimes.
You could split the assigment in two formats: format where simple principle of the course are first resolved on jupyter notebook (just like the videos case but with more exercices) and complex dataframe case as second assignment .(but please reduce the amounts of case to only 1 or 2, not 3)
You could reduce dataframe case.(I've spend easily 40 hours on assigment here, assigment time is too heavy from my workload as a full time scientist. This needs some carefull tuning.
Overall Great Job
автор: Jeffrey D R•
May 07, 2018
Like many others, I give this course a high rating while lodging a minor complaint that there wasn't much instruction provided. The lectures were excellent, if brief; it's hard to imagine anyone having objections to the instructor. But in terms of teaching the material, it was a bit of a drive-by. The lectures show a few examples, while not explaining the syntax or the various parameters. You have to draw that out of web sites and cheat sheets. If you're not adept at doing that, proceed with caution here. In the end, I was worn out from the effort, but felt that I had gained a lot.
The assignments were challenging for me because this was my first hands-on experience with Python, much less with Pandas. I did not find Stack Overflow as helpful as the instructor suggested. Nor did I find much help in the forums, but that's not quite my style.
My bottom line is that the course was time well-spent, but it could easily have been a six-week course with a more deliberate pace through the various pandas mechanisms such as merging and grouping.
FWIW: My recommendation is to get to know Jupyter Notebook early and follow along with the lectures by opening the Week[x] files in the course download folder. You can pause the lecture while you go play with the code to make sure you understand it. Also, I recommend working with a local version of Jupyter and keep your files local. Otherwise, Jupyter loses connection to the kernel, and stops being able to save your work. The messages are disconcerting, and if you've worked yourself into a frenzy, they can cause panic and confusion. So do all the work on your machine and then upload the whole assignment when you are finished. You upload on the "Create a Submission" screen; it takes only a sec. You won't even have to worry about details like file paths; they'll be the same either way. Once you get the hang of Jupyter, you can settle into a work routine. Learn some of the keyboard shortcuts.
автор: Zhenxun Z•
Jan 12, 2017
I really like Prof. Brooks's way of teaching. He developed a very good introductory level course. Apart from some talks about data science in a whole, he concentrated on the preparatory work in this field -- data cleaning. Instead of delving into theories, he paid most of his attention to how to make things work by using python. I actually have a background in C, and I was a bit reluctant to learn python at first since C is already strong enough to attack most tasks. However, I have fallen in love with python now, and I think it is a much more suitable language for daily use especially when your projects aren't very large. Among its many merits, the best thing about python is of course its numerous libraries like numpy and pandas which free us from tedious low-level programming. I am quite convinced that I will move to python from now on.
In addition to lectures, I truly recommend you go over extra reading materials. Those articles are very thought provoking. For example, the first one "50 Years of Data Science" totally changed my previous view towards this field. It made me realize that data science is not a simple combination of statistics and machine learning, that it is a distinct way of obtaining new knowledge, and that its advancement shall benefit the whole science society.
About the assignments, those taught in the lecture are not enough and you should refer to python documents and stack overflow. I think knowing how to solve problems and where to find help is more important than solving problems itself, and that's why I consider those assignments well designed.
Finally, thanks to all the efforts made by the teaching staff.
автор: Shawn T R•
Jul 12, 2018
Overall a great course which really pushed me to improve my Python skills and get more comfortable with pandas, which is really powerful for data analysis work. It also showed me how awesome Jupyter notebooks is to use. I'll be using it in all of my Python courses moving forward, whether or not the course requires it. I will say though that the estimates for the amount of time the courses will take per week are way too low. This is a problem I've encountered on every MOOC platform I've ever used though. They really just want to get you in and saying that you'll be spending 15 hours per week on a course will scare many people away. I've easily spent more than that for some weeks in this course. In the end though, I didn't feel that my time was wasted. The assignments are challenging and really force you to get better at Python if you want to try to solve them on your own and not immediately resort to the forums. I'm probably just a bit of a masochist that way, and it honestly may have doubled the amount of time it took to finish the course, but I find trying to solve the problems with as little guidance as possible very rewarding. You just become a better coder overall.However! If time is a major concern and masochism isn't your thing I highly recommend just giving it a go for only an hour or so if you're stuck end then going to the Discussion Forums. There are very useful posts there from the teaching assistants that will show you the most efficient ways of solving the problems the "Pandorable" way and save you gobs of time. TL;DR = Loved the course and would highly recommend it :-)
автор: Victor M S D•
Jan 30, 2017
Very nice Course, You will Learn about how to effectively use Pandas Library for Python and how to treat DataFrames in that ambient, there are nice functions and methods for parsing. The Course is very fast pace, I only have time on the weekends (some of those), so I had to switch dates two times. Also, some materials are very fast, so If you are new in Python, got to be sure if you have mastered prior concepts of the course (Week 2 depends on week 1 and so on ...). A large part of the course involves your own research in Python Docs and StackOverFlow page. As I am an R user, some things are intuitive (and maybe more easier for me to do in R), several of the things in comparison I thought: "Wow, but this is so much easier on R", but at other times I saw the power of Python for parsing tasks or webdata that in the R require too much memory or are more complicated to obtain. R have the problem to treat everything like an object and guided by vectors, but at some parts that makes understand coding details more deeply, at least for me. I still think that the documentation of R the best there is for my purposes, so I will stick with that, but it's great to know how to develop some things in Python, mainly because of my goal of getting some applications to end users. Finally, not much related to the course, but maybe with the change in the platform of the Coursera, the forums seemed a little more confusing and a little more slower than former courses that I took, I think it might have to do with the same course running in parallel on different dates.
автор: Jakob B•
Nov 19, 2018
Very valuable course for the fearless and stress-resistant. The scope and challenge level of the assignments is excellent. When you finally get the code right, you feel like you can solve any puzzle, because you can only get it right, when you completely understood why it's not giving the correct answer and you get those "Ohhh, that's what's wrong" moments all the time. So there is zero chance, that you will come out of this course having no idea what you actually learned. (Sadly a prevalent feeling at universities nowadays)
Apart from that, there is zero handholding and you MUST understand basic programming concepts. When even the course masters specifically tell you, that you will have to "google around" for that one vital code snippet, they mean it. This irritated me at first, because I expected a course on something to be self-contained (except the pre-conditional knowledge of stats and programming). But after completing it, I realized, that a big part of the professional programmer's workflow indeed does consist of googling for an answer on stackoverflow, since one and the same problem can be solved using a myriad of ways.
Big props to the creators of the assignments. Those were fire!
Best regards from Germany.
автор: Vinod G•
Nov 19, 2019
Excellent material. Admittedly I can see why there are so many negative reviews about the ambiguity of the assessed tasks. It won't be an easy course for anyone who is unfamiliar with programming. However, if you do have programming experience under your belt, you'll likely find this course strikes an excellent balance in terms of conciseness, practice, and theory. Each lecture is crafted carefully to teach you about some nuance of pandas or numpy, and the programming assignments are packed with coding questions that will help you revise what you have learned, in a very efficient way. There is very little "fluff" in this course, which is a major weakness I've seen in similar courses of its kind. Too much spoon feeding often does not challenge or engage the learner. The course is very direct about what it expects of its students. Every week there is a comment "This week's assignment requires more self-learning than the last". And true to its word, there is less and less hand-holding as you go further into the course. I thoroughly enjoyed the material and probably learned the most out of this course than any other course I've taken on Coursera, taking in to account its length.
автор: angelo p•
Sep 18, 2017
I am a practicing engineer with over 15 years of experience.This course was definitely not for the faint of heart. While the lectures themselves were crisp and easy to follow, the homework assignments were truly a test of your patience and ability to think through the fundamentals. In all 4 weeks,I took way more than the predicted time to complete the assignments. I still gave the course a 5 because of what it taught me. I would never have learned so much had the assignments just been a rehash of the lecture examples. I applied the material to real world data analysis in my field with very good results. I could not have done the computations I did, if I had not taken the course seriously. There could be more clarity in the assignments especially in week 3 and week 4 but then that's the fun too - in real world data, there is no instructor to watch your back. So mistakes made during the course while frustrating for a working professional with limited time translate to better outcomes in actual work. I must also acknowledge Sophie Greene for her efforts to guide students to think through the problem statements.
автор: Julien M•
Dec 09, 2016
This is a *very* good Python Pandas course that's part of a data science specialization that I wish existed 3-4 years ago when I first started using Pandas. As much as I like Wes McKinney's book "Python for Data Analysis", having an e-learning format mixing video lectures and hands-on assignments is a definite plus.
I've decided to take this class out of curiosity (and in order to get ideas about a class I'm about to teach), and even if I consider myself as an experienced pandas user (top5% on stackoverflow for [pandas]). I think it's pretty involved if you're new to pandas and covers pretty much every key concept of pandas you should (must?) be aware of. The assignments can certainly look very challenging to the newcomer, and they do a great job of looking like an actual project: messy data, multiple sources, etc.
I dearly recommend - and I have already started to personally recommend it - this course for anyone who's interested in learning more about data wrangling in pandas!
Thanks for putting this together!
автор: Eshaan K•
Feb 02, 2017
I certainly learnt a lot through this course. The hands on oriented approach of this course works wonders for those who intend to dig deeper and improve their Python programming. The lectures are crisp and clear, but the main learning happens while doing the weekly assignments. Each assignment forces us to go search for content beyond what has been taught in the lectures and in this process we find new things. Also, the lecturer informed us about some good reference books for individual learning. I look forward to taking part two of this course.
The mentors in the discussions forum are quite active and helpful. The forum was of big help to me for clearing the assignments. The only flaw I found was that the autograder does not clearly define our mistakes and that's where the mentors come in and explain in the discussion forum. This part could be better automated.
Finally, I believe that a prior introduction to Python programming is an essential prerequisite to this course.
автор: Jeff C•
Feb 28, 2017
The course was a very efficient intro to using Python to get data from various simple sources and the different structures that can be used to hold and manipulate the data. The course starts with the intuitive methods to solve problems and then introduces more complex and process friendly alternatives to demonstrate how creativity and knowledge of Python can be used to generate elegant and efficient code. The auto-grading of assignments can be a little maddening as the correct answer needs to be formatted precisely as is expected or else it will be counted as incorrect. The good thing here is that it forces you to fully understand the data types and data, the bad is it can be frustrating not knowing if the answer is wrong or just the format of the answer. Admittedly, I feel this effort forced me to become a better Python programmer.
автор: Yusuf E•
Mar 03, 2018
The assignments in this course are extremely challenging and almost unsolvable if not for the forum/stackoverflow help. Among the nine certificates I got on Coursera, this has to be the most difficult to obtain. It even surpasses getting and cleaning data or r programming in the Johns Hopkins Data Science specialization. Also, some more time on lambdas would really be appreciated given how important they seem to be in Pandas.
Having said that it was a real pleasure going through the assignments, repeatedly debugging them and getting to the answer finally. I have no complaints with the structure of the course or the contents. Given how easy the Python for Everybody specialization was this course was really refreshing. I hope for more of the same for the rest of the specialization.
автор: Rajendra K•
May 09, 2017
An excellent course, which requires more self learning than what is taught. I extended the course and completed the final assignment in a week. The assignments involving data cleaning actually helps a budding data scientist. Documentation, Stack Overflow for assignments is a must which can boost your understanding not just for this course but for a lot more. My personal satisfaction about this course is mainly stackoverflow part which helped me to understand the lectures weeks ahead. But application of the concepts on assignments is a difficult one which can only be mastered after practice. Thanks to all mentors, especially "Sophie Greene" whom I followed a lot. Her debugs, algos and code checks have actually helped me to understand both python and Pandas better.
Dec 13, 2017
The aim of this course is to help us get familiar with pandas. If one has already been quite familiar with functions in pandas, this course could be too simple. However, as someone who is just new to pandas, I have to spend quite some time getting familiar with those functions so that I can finish the assignments.
This course really helps I think. As I moved along the course, I really find myself more familiar with the design logic of pandas and gradually work more fluently in assignments.
Besides, the mentor in the forum is so helpful and has provided lots of hints to help us move forward. I'm really grateful to it but still suggest that new students should really work hard on our own before skimming the forum, because some hints somehow cross the line.
автор: MEILIN Z•
Feb 05, 2017
This course taught ma Pandas and Hypothesis test, which are very useful in my study and future work. For example, I learned how to sort values, merge tables, and reset indexes. And most parts is like SQL.
Lectures will give some basic knowledge of the contents for each week. And this course also provide enough related documents, including codes and slides in classes. And it also has subtitle, which helps me a lot for my understanding.
When I was doing assignment, I still need to search more information from internet and learn by myself. I think this is a great way to help me know how to solve problems by myself. I think it is also a little bit challenging for a beginner, so I suggest beginner to learn some basic knowledge before taking this course.
автор: Gabrielle S•
Sep 18, 2017
I'm very happy with the course overall, especially the challenges that the graded assignments offer. The lecture covers just enough detail to give you a broad understanding of the topic, but allows room for self-discovery, as in having to read the docs to accomplish your assignments. I'm happy with the quality of instruction and level of knowledge that the lectures have as well. The main instructor was very articulate and demonstrated a deep knowledge and a lot of experience with Python pandas, as well as statistics. The discussion forums have been extremely helpful throughout completing assignments, and got me moving from where I was stuck. I've certainly leveled up my python and pandas (especially pandas) skills from taking this course.