HomeНаука и техникаRelated VideosMore From: GOTO Conferences

GOTO 2012 • The R Language The Good The Bad & The Ugly • John Cook

787 ratings | 144988 views
This presentation was recorded at GOTO Aarhus 2012 http://gotocon.com John Cook - Research Statistician at M. D. Anderson Cancer Center ABSTRACT R is a domain-specific language for analyzing data. Why does data analysis need its own DSL? What does R do well and what does it do poorly? How can developers take advantage of R's strengths and mitigate its weaknesses? This talk will give some answers to these questions. https://twitter.com/gotoamst https://www.facebook.com/GOTOConference http://gotocon.com
Html code for embedding videos on your blog
Text Comments (43)
Theodore (1 month ago)
Nice examples, and well explained.
Marie G Jean (2 months ago)
U
Eee Eee (4 months ago)
Sowsossow
Akshay Singh (6 months ago)
Oh my god we are missing data.. what are we gonna do throw an exception call the police...:)
Samsul Bahri (3 months ago)
Akshay Singh dyj
Technogeekscs Pune (6 months ago)
Learn r programming from our experts who are already a working professional in R Programming. www.technogeekscs.com
Russ Kee (1 year ago)
2012 , this is old. The latest R language is super good with all the libraries
km2052 (2 years ago)
thx
Oleksandr Fialko (2 years ago)
"The alternative to R isn't Python". Well, Python is alternative to everything.
Jerry Lee (2 years ago)
Do you need a background in statistics to use R?
ein mensch (1 year ago)
You don't need it, you'll get it when writing R code.
spArKly_spArKly (2 years ago)
No - not at all. No more than any other programming language. However R excels at statistics.
Derek Murawsky (2 years ago)
Emacs webserver? Yeah, that exists... lol https://github.com/eschulte/emacs-web-server
meathook3000 (6 months ago)
Why?
Tan Pham Van (2 years ago)
BEST R language BOOKS EVER The Art of R Programming: A Tour of Statistical Software Design http://amzn.to/22pmUn7 R Language: for Absolute Beginners http://amzn.to/22pmSM6 R Cookbook (O'Reilly Cookbooks) http://amzn.to/1PiaiBp R in Action: Data Analysis and Graphics with R http://amzn.to/1UbNpYL An Introduction to Statistical Learning: with Applications in R (Springer Texts in Statistics) http://amzn.to/21Akix9
Tpv tpv (2 years ago)
BEST R language BOOKS EVER The Art of R Programming: A Tour of Statistical Software Design http://amzn.to/22pmUn7 R Language: for Absolute Beginners http://amzn.to/22pmSM6 R Cookbook (O'Reilly Cookbooks) http://amzn.to/1PiaiBp R in Action: Data Analysis and Graphics with R http://amzn.to/1UbNpYL An Introduction to Statistical Learning: with Applications in R (Springer Texts in Statistics) http://amzn.to/21Akix9
You could avoid the waste of memory by typing : x<-sum(rnorm(1e6)) at 26:18
cepi24 (2 years ago)
can you recommend good books to learning statistics with R?
Arun Rangarajan (2 years ago)
"Learning Statistics with R" by Daniel Navarro is an excellent introduction. Free legal copy is here for download: https://health.adelaide.edu.au/psychology/ccs/teaching/lsr/ The hard copy is available for purchase for a nominal price of US $30 from lulu.com and I have it. It is a huge unwieldy book, but otherwise I have no complaints.
2freet (2 years ago)
+cepi24 I'd recommend online courses. Have a look at datacamp.com.
Gerard (2 years ago)
Did anyone else catch the s-- joke at around @3:22
Semiha Şamiloğlu (2 months ago)
+Bart S mk
Orpheus (2 years ago)
+Gerard Jitec yeah hahaha
Bart S (2 years ago)
+Gerard Jitec Yes :) funny though.
quartzimaging (2 years ago)
Dude starts off saying people use R for statistical research but not for analysis. Three minutes later he shows data that says R is the by far most popular analysis tool. wtf?
Finan (2 years ago)
i disagree with the speaker... Python is an alternative R! Please have a look at the libraries numpy, pandas and matplotlib!
meathook3000 (6 months ago)
One of his points was that R works out of the box for doing statistics analysis and research, without having to install extra packages.
Shrinidhi Rao (2 years ago)
+Finan It is NOW. Not the case for many years.
avro549B (2 years ago)
When discussing the efficiency of programming languages, especially those for use as experimental tools, human time is multiple orders of magnitude more important than machine time. Bit-nanoseconds cost infinitesimal amounts, human time is not infrequently expensive in itself, and subject to multiplication by the cost of project delay. Worrying about computer "efficiency" is mostly a legacy of 1960s thinking, when computers were expensive and people cheap.
avro549B (2 years ago)
Any programming language, no matter how specialised or domain-specific, will be used to write a least one application for which it is utterly unsuitable. Sometimes, this is because the author doesn't know any better. Other times, the motivation is sheer programmer perversity.
Mike Clark (3 years ago)
This is a very nice overview of R for the beginner. I won't pick apart the language "gaffes" when the speaker defaults to more common programming-speak (e.g., use of term "modules"), as an audience familiar with other languages is more likely to understand those terms as opposed to the quirky R terminology, and yes, explicit loops in ANY interpreted language usually are the devil. However, as someone who has programmed in R and SAS for 10+ years now, I do take issue with the characterization of SAS as being on par with Windows Command Prompt in terms of data management functionality. That's extremely inaccurate, but I'll give him a pass on that one, as he admits to having no prior experience with SAS.
David Winsemius (3 years ago)
This guy doesn't know R very well. Assignment of a data-object to a function name does not mask the function. For-loops are not inherently inefficient (this was fixed about twenty versions ago). You should not use attach(). R uses the term "package", not "module". The "parallel" package was built into the base distribution several versions ago (which does use parts of R). He was way behind the available books at the time this talk was given. And... how can you give a talk on R and mangle Prof Brian Ripley's name?
James Levy (1 month ago)
MrGoatflakes m
Theodore (1 month ago)
People are still converting for-loops to vectors for better efficiency in 'R' : https://youtu.be/gaJorAU644o?t=14m10s
MrGoatflakes (3 years ago)
+David Winsemius hmm fancy that. I have done R part time for about 5 weeks and already I know that the function namespace is separate from the data namespace. I just don't know _why_ this is true (convenience?). Seems to clash with the "functions are first class objects" principle and the neat syntax R has for functions which implies this. That is, if you have a function you want to name, you do something like: myfunction <- function (x) mean (x$foo) which is syntactically similar (and is far as I know the same) as assigning to a data object: x <- 3.7 whereas if you want an anonymous function you do something like lapply(data, function (x) mean(x$foo)) Which is very suggestive of R functions being first class objects imho.
Kingdeme (3 years ago)
Is there anyone who knows how ot embed R in c++?
Shokodemon (3 years ago)
+Kingdeme r offers its mathematics library - fire up synaptic and check it out if you're on ubuntu. just look for r-mathlib
Guy Dawson (3 years ago)
+Kingdeme you might find this useful https://github.com/eddelbuettel/rinside
Ahmad Abd-Elghany (3 years ago)
The R Language The Good The Bad And The Ugly
nasser alshammari (3 years ago)
Oh! I haven't heard of Haxe before. looks interesting.
Ahmad Abd-Elghany (3 years ago)
off topic: I've just discovered Haxe en.wikipedia.org/wiki/Haxe?oldformat=true :D, you write in Hax and you get JavaScript , Java, C#, C++, Python, PHP, CGI, and Node.js for that specific topic: I guess am not willing to do R in any time soon, so py to R sounds great for me
nasser alshammari (3 years ago)
Great talk from John Cook! I'm invested in Python+Matplotlib+SciPy+Pandas for my scientific work :) I know of a library called "rpy2" that allows you to write R in Python. But I havn't tried it before.
Gerald Creedon (3 years ago)
Excellent introduction / description of the advantages and pitfalls of using R.

Would you like to comment?

Join YouTube for a free account, or sign in if you are already a member.