Here is the latest bag of tweets*, which covers **July 2013**.
(*) These are interesting news that I found on Twitter and that I archive periodically.
* newsycombinator: Bootstrap 3 RC1 (27 Jul)
* JeromyAnglim: I'm very excited about the new Journal of Open Psychological Data @JelteWicherts (27 Jul)
* JeromyAnglim: Great set of articles on data sharing in psychology by @JelteWicherts (27 Jul)
* neilfws: Easy pictograms using R (27 Jul)
* dataJujitsu: We now have a holding page for the @medcodes repository: Full database out soon. #opendata in #publichealth research (25 Jul)
* webbedfeet: Zipf's Law with #pydata packages #datadc (25 Jul)
* JohnDCook: You're working on a big data set if it's faster to mail a hard drive containing the data than to download it. (25 Jul)
* UnixToolTip: Awk FAQ (25 Jul)
* drago_carlo: Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts (25 Jul)
* copheehaus: Hello World! Tweeting new posts from Coffeehouse blog aggregator - Collection of data blog posts - (25 Jul)
* RevoAndrie: RT @RevolutionR: Do ppl ask you, "What is #R?" If they do, we made a little video that we think may help. #rstats (25 Jul)
* StefanKarpinski: The first nbviewer produced with our brand new pure Julia IPython kernel: . #IPython #JuliaLang (25 Jul)
* newsycombinator: More Git and GitHub Secrets (25 Jul)
* gappy3000: There are at least four alternative fast experimental implementations of #rstats. (25 Jul)
* ewancarr: Useful for presentations: a list of sites for finding freely available images online (via @LSEImpactBlog) #phdchat (24 Jul)
* carlislerainey: In case you missed it, I posted an alpha version of compactr, a package that I use to make graphics in R. #rstats (24 Jul)
* genetics_blog: Practical #Typography (24 Jul)
* moorejh: #infovis #dataviz RT @franknorman @scholarlykitchn: Dynamic #Visualization of Biomedical Journals 2003-2012 (24 Jul)
* AndrewRAConway: Nice latent variable models ~ What is working memory capacity, and how can we measure it? | ReadCube Articles (24 Jul)
* RDataMining: Lecture slides of a MapReduce course, which is a part of 2008 Independent Activities Period at MIT (24 Jul)
* jedisct1: RT @bketelsen: Have you guys seen tmate? Brilliant screen sharing. (24 Jul)
* fperez_org: First light for a native Julia kernel implementing the #IPython protocol today. S Johnson, J Bezanson, S Karpinski (24 Jul)
* drago_carlo: Gaussian kernels via @prismatic #MachineLearning (24 Jul)
* zmalltalker: Famous Emacs Users (that are not famous for using Emacs) | 肉山博客 (Wenshan's Blog) - (24 Jul)
* randyzwitch: A Beginner's Look at Julia #julialang (24 Jul)
* cemerick: Commodore 64, 128, and GEOS fonts! (24 Jul)
* RDataMining: Data-Intensive Text Processing with MapReduce, a book of 175 pages. Its online PDF version is provided at . (24 Jul)
* moorejh: #genetics #genome #health MT @DaleYuzuki @genomemedicine: What has #genomics ever done for medicine? (24 Jul)
* JustGlowing: Combining Scikit-Learn and NTLK to use SVM classification (24 Jul)
* vsbuffalo: Ran across "In All Likelihood", instantly bought it. Like Casella & Berger of 21st C. I have a book buying problem. (24 Jul)
* JeromyAnglim: Interesting question: When is it defendable to use manifest instead of latent variables in psychological research? (23 Jul)
* epistatistic: An interesting example of a time-stratified case-crossover analysis. I highly recommend it! (23 Jul)
* SmartTypes: Think Bayes (free e-book): Great mix of Python & Bayesian statistics via @suneman : @SmartTypes (22 Jul)
* viatropos: pretty cool idea, a repository of ideas - (22 Jul)
* thomaslevine: is sort of the best website ever. (22 Jul)
* writelatex: F1000Research Journal Article Template (to go with today's announcement of free submissions) (22 Jul)
* joscani: Analyzing Your Data on the AWS Cloud (with R) vía @rbloggers (22 Jul)
* worldbankdata: Accessing the @worldbank #opendata APIs in Python, R, Ruby and Stata: (22 Jul)
* csarven: Stats Analysis #LinkedData #Statistics #Analysis #UI (22 Jul)
* DrDanielSwan: Paper: Estimation of allele frequency and association mapping using next-generation sequencing data (22 Jul)
* moorejh: Great collection of #visualization vignettes from Lawrence Berkeley Natl Lab #infoviz #infovis #dataviz (22 Jul)
* moorejh: Inference for additive interaction under exposure misclassification #epistasis #complexity #datascience (22 Jul)
* LaTeXTemplates: New #LaTeX template for a FAQ document: Try it out now on @sharelatex and @writelatex! (22 Jul)
* stefanvdwalt: Sparse kernels FTW -- our diffusion paper from ISBI2013 Accompanying notebook here: (21 Jul)
* jtoy: Random forests in python (21 Jul)
* newsycombinator: Bash Configurations Demystified (21 Jul)
* rockymadden: Clojure to native code via Scheme: (21 Jul)
* jasonbaldridge: Also: Juola is one of the foremost researchers working on computational stylistics. Here's his book. (21 Jul)
* _DavidMcCormack: @NewPsychologist excellent site explaining human behaviour and how the brain works from McGill University, (21 Jul)
* LewisSpurgin: My Dad asked me how much Evolution paid me for my paper. Had to explain that I paid them, and how much they charge to download a pdf :( (21 Jul)
* bffo: Brad Chapman uses Gemini for integrating data in sql #ISMBECCB #BOSC2013 #translational #bioinformatics (20 Jul)
* recology_: Datawrapper - An open source tool to create embeddable #d3 #highcharts charts in minutes via @mfenner (20 Jul)
* SciPyTip: New advances in conda (20 Jul)
* treycausey: All of the slides and videos from Hadoop Summit are up. (20 Jul)
* ErgoEmacs: Emacs Form Feed (^L) Display Suggestion and Tips (20 Jul)
* newsycombinator: Netbeans 7.4 beta with awesome set of features (20 Jul)
* mfenner: New blog post: Creating charts with Datawrapper (20 Jul)
* Bill_Gardner: NYTimes: Let’s Shake Up the Social Sciences (20 Jul)
* planetpython: Mike Driscoll: Reportlab – All About Fonts (20 Jul)
* pushthings4ward: [PDF] Data Harvesting and Cleaning | #ddj #scraping #research (20 Jul)
* mgrafit: Learning #d3js by the book - (20 Jul)
* paulblaser: On learning bioinformatics - (19 Jul)
* kragen: A small Lisp program from 1960, and how to get it to run in Common Lisp. Thanks to Rainer Joswig for finding this! (19 Jul)
* ibdknox: . @swannodette that is wonderfully magical. #ClojureScript (19 Jul)
* thoughtbot: Tmux copy & paste on OS X just became a whole lot easier: (19 Jul)
* OlivierVerdier: tips for modern #LaTeX typesetting // cc @TeXtip (19 Jul)
* rickasaurus: Javascript isn't Scheme I love this post (19 Jul)
* webbedfeet: RT @ContinuumIO: Check out our new blog: #PyData #Wakari #datadc (19 Jul)
* webbedfeet: "R spells for data wizards" #datadc #rstats (19 Jul)
* treycausey: Visualizing classifier results with #ggplot2 (19 Jul)
* CompSciFact: Books for CS grad students (19 Jul)
* jjgibaja: R documentation search - All R packages and R manuals from CRAN and Bioconductor (via @vginer_upv) (19 Jul)
* jjgibaja: The summer of R (via @kdnuggets) #rstats (19 Jul)
* abmathewks: Plotting Likert-Scales (net stacked distributions) with ggplot #rstats via @strengejacke #Awesome (19 Jul)
* moorejh: New book on #MachineLearning with #Python #bioinformatics #datamining #datascience HT @marlecce (19 Jul)
* MongoDB: How to speed up MongoDB Map Reduce (19 Jul)
* neilfws: "GitHub...a social network where programmers get together and get work done without bosses, e-mails or meetings": (19 Jul)
* ramnath_vaidya: Here is the link to the timeline #rstats #Cricket (19 Jul)
* xieyihui: @cpsievert @rstudioapp yes, and the source is here (19 Jul)
* hmCuesta: Generating random numbers and plots in Python (18 Jul)
* timelyportfolio: #rstats #rCharts dev branch now has #jquery datatables .@ramanth_vaidya (18 Jul)
* siah: How R will turn into SQL (18 Jul)
* cyberlabe: Data visualisation app expected to launch this month (18 Jul)
* iragreenberg: I'll be periodically adding to this collection of workshop examples: (18 Jul)
* moorejh: #datamining MT @sciencebase #bioinformatics method for predicting whether a chemical will be toxic (18 Jul)
* CjBayesian: Game theory "Hunger Games" tournament with low barrier to entry. Hack on. #python (18 Jul)
* statslabdublin: programming with big data in R : high-level distributed data parallelism in R (18 Jul)
* statslabdublin: Large-Scale Predictive Modeling with R and Apache Hive: from Modeling to Production (from UseR!2013) (cc @Conker_io ) (18 Jul)
* ConcejeroPedro: #useR2013 You must Read this about R naming conventions part of this (18 Jul)
* xieyihui: thanks to the smart audience @Bioconductor 2013, I finally figured out the right way to process lang engines in knitr (18 Jul)
* drago_carlo: Python for Data Analysis: The Landscape of Tutorials via @prismatic #Statistics #DataAnalysis #Python (18 Jul)
* technomancy: Blogged about comparative lispology: The same simple game across Scheme, Racket, Clojure, and Emacs Lisp. (18 Jul)
* planetclojure: An introduction to deep code-walking macros with Clojure (17 Jul)
* Atabey_Kaygun: New post: A Gradient Descent Implementation in Lisp (17 Jul)
* joscani: likert package. cool! (17 Jul)
* zentree: 'Statistical education in the 21st century: a review of challenges, teaching innovations and strategies for reform' (17 Jul)
* JeromyAnglim: New Post: Evaluating the Potential Incorporation of #rstats into Research Methods Education in Psychology (17 Jul)
* MarcusMunafo: Yet more evidence that the average power of studies in many fields of biomedical research is low (~20% again): (17 Jul)
* medriscoll: The vanguard of data visualization lies in scientific figures: a powerful, complex view of cancer via @NatureMagazine (17 Jul)
* GaelVaroquaux: Replicability is not Reproducibility: Nor is it Good Science (17 Jul)
* zentree: Dahtah: Simon Barthelmé's blog on #stats, #data, etc. HT: @berndweiss (17 Jul)
* wesmckinn: @gvanrossum sorry, you have to click around the individual benchmarks. e.g. (17 Jul)
* zentree: 'The State of namingConventions in #Rstats'. Hear, hear. (17 Jul)
* nathanmarz: Excellent post from @cemerick. Having control over 100% of your time is indeed awesome (17 Jul)
* _inundata: R docs are now available in Dash! Thanks @kapeli #rstats (17 Jul)
* heroku: How Heroku Works by @JonMountjoy (16 Jul)
* ibdknox: A preview of #LightTable 0.5 (16 Jul)
* climagic: zbarcam --prescale=640x480 |while read c;do [ "${c%%:*}" = "EAN-13" ] && xdg-open http://en.openfoodfacts\.org/product/"$c";done # Barcodes (16 Jul)
* FGRibreau: Seafile - A file syncing and collaboration platform for teams #dropboxlike // Self-hosted version available (16 Jul)
* ogrisel: All scikit-learn tests pass under Python 3.3 in master. Thanks @justinvf and @larsmans_nl! (16 Jul)
* swcarpentry: Where should we submit for publication? (Must be a real open access journal.) (16 Jul)
* msgbi: RT @newsycombinator: Cello: Higher level programming in C (16 Jul)
* berndweiss: "The Big Picture: A Workflow for Reproducible Research" (by @ChrisGandrud ) (16 Jul)
* juampatronics: Benchmarking different implementations of random forest #machinelearning #randomforest #Python #wiserf (16 Jul)
* MBstream: Auto-WEKA via @prismatic (16 Jul)
* zentree: 'A Multi-Paradigm Modeling Definition and Language' by @b__k (PDF). (16 Jul)
* HlthAnalysis: Recommended Readings for Data Visualizations (16 Jul)
* hmCuesta: Explore Python, machine learning, and the NLTK library (16 Jul)
* planetclojure: Purely Functional Data Structures in Clojure: Red-Black Trees (15 Jul)
* a_matsuda: "a Ruby program that generates Scala program that generates Scheme program that generates ...(through 50 languages)" (15 Jul)
* mikaelhuss: R spells for data wizards (15 Jul)
* siah: Unix: Chopping up CSV files (15 Jul)
* M_T_Patterson: Definitely check out @CjBayesian 's intro to data science competitions (via @Rbloggers ) #RStats, #DataScience (15 Jul)
* BiologistJohn: User Guide for Python for Population Genomics #python #genomics #biology (15 Jul)
* John4man: Useful unix commands for data science. I love doing as much as I can in Bash. xargs is the shiz. (15 Jul)
* ArtProgramming: "Lisp is the result of taking syntax away, Perl is the result of taking syntax all the way" – Doug Hoyte (15 Jul)
* fonnesbeck: The deal is done: I now have 14 months to write “Bayesian Statistical Computing using Python”. (In my copious amounts of spare time) (15 Jul)
* inductivestep: Possibly interesting collection of questionnaires, including the ECR-R (15 Jul)
* fperez_org: xkcd() now officially supported matplotlib API; @jakevdp showcases it in full glory: . #NeverADullPlotAgain (15 Jul)
* wesmckinn: New JSON support in pandas 0.12 is going to be a game-changer: #pydata (15 Jul)
* Atabey_Kaygun: Introduction to Restricted Boltzmann Machines (14 Jul)
* Atabey_Kaygun: Restricted Boltzmann Machines in R (14 Jul)
* Rbloggers: R talks to Weka about Data Mining: (This article was first published on fibosworld » R, and kindl... #rstats (14 Jul)
* StatFact: Do clinical trials work? (14 Jul)
* ronert_obst: New post: "List Comprehensions in R" (14 Jul)
* EpiFunky: Stephen Senn about the ‘indefinitely many predictors’ criticism in randomization via @StatsBlogs (14 Jul)
* eddelbuettel: Slides from my Rcpp talk to the Sydney R Users Group this week: #rstats #rcpp (13 Jul)
* swannodette: I haven't been this excited about a blog post in a very long time #complexity #js #ui #async (13 Jul)
* ctitusbrown: How can we do literate programming for reproducibility, in Python? guest post on my blog from @yardenkatz (13 Jul)
* drago_carlo: Book: Applied Data Science @analyticbridge #DataScience #Python #Statistics #MachineLearning #DataMining #Book (13 Jul)
* ChrisBeeley: Open source software for healthcare. Start with free, finish up awesome (13 Jul)
* alandipert: Faster Clojure Startup with Class Data Sharing (13 Jul)
* trished: Reporting Guidelines in the biomedical literature #springerlink Useful review, & not paywalled (13 Jul)
* gawbul: Great free book on Linux command line :-) (13 Jul)
* karpathy: Direct link to Colin Lea's CVPR 2013 papers page (pretty LDA, tfidf rankings) Now want this for all CV/ML papers... (13 Jul)
* freakonometrics: [free ebook] "Speaking Serial R with a Parallel Accent" (Programming with Big Data in R) (13 Jul)
* RickWicklin: Great analysis of nerf guns. #rstats via @scheidegger See this simpler experiment by my kids (13 Jul)
* genetics_blog: Next generation sequencing of viral RNA genomes #genomics (13 Jul)
* freakonometrics: nice! "The beauty of scientific papers - Design trends of the past 350 years" (discovered via @goulu) (13 Jul)
* mbostock: New release notes for everything going back to 3.0. Plus codenames! #d3js (13 Jul)
* hylopsar: Ben Bolker's notes on workflows, pipelines, reproducible research, etc. via @wordpressdotcom (13 Jul)
* ramnath_vaidya: Integrated Learning Environment: A Prototype #rstats @kaythaney @gvwilson #rstats @hadleywickham @popcornjs (13 Jul)
* johnros2013: An Introduction to Social Media for Scientists (13 Jul)
* bradfordcross: Solid look at what we're trying to solve (13 Jul)
* ConcejeroPedro: #useR2013 gReat qgraph objects can be exported to igraph and make fantastic chord graphs of item-factor relationships (12 Jul)
* Bioconductor: #Bioconductor / #rstats new course material from #CSAMA 2013, #useR2013 (12 Jul)
* tslumley: @b__k @zentree cf Rob. Gentleman "data is large and confidential; code is small and open-source: which one do you send over the internet?" (12 Jul)
* cemerick: @rkneufeld Re: jruby+sass+clojure, this looks promising (11 Jul)
* berndweiss: RT @zoonpolitikon: Another lost soul. Sigh. "Dear LaTeX: It’s not you, it’s me" (11 Jul)
* adolfoalvarez: @adolfoalvarez: Rstudio announces at #useR2013 a new preview version. (11 Jul)
* Rbloggers: rxDTree(): a new type of tree algorithm for big data: (This article was first published on Revolu... #rstats (11 Jul)
* statslabdublin: : looks quite interesting useful teaching tool #user2013 lightning talks (11 Jul)
* HarlanH: A Julia Meta Tutorial - Data Community DC #JuliaLang (11 Jul)
* betheross: Just setup my first private @github repository. Excited! Students and postdocs can get a free account HT @ethanwhite (11 Jul)
* GuyHorev: Manuscript Checklist Statistical Problems to Document and to Avoid (11 Jul)
* revodavid: Slides from my #User2013 presentation, "Real-time, Big-Data R? Yes you can!": #rstats (11 Jul)
* ConcejeroPedro: #useR2013 interactive tableplot of bigdata with tabplotd3 (11 Jul)
* ConcejeroPedro: @gmiritello #useR2013 tabplot - demo script online very good! (11 Jul)
* ConcejeroPedro: #useR2013 exploring big data with tabplot - slides (11 Jul)
* jedisct1: RT @franckverrot: hstore vs. JSON - Which to use in Postgres (11 Jul)
* robjhyndman: Asking good questions (10 Jul)
* Biff_Bruise: MT @PeterSpeyer: New #dataviz w/ data on obesity, phys. activity, life expectancy in US counties. (also time series) (10 Jul)
* hemant_Pt: Awesome keynote on visualization going on #ICWSM - check out (10 Jul)
* ramnath_vaidya: rCharts + Polychart + AngularJS! #rstats #angularjs #d3js @Polychart @d3visualization (10 Jul)
* DrBunsen: New blog post—Matryoshka Hardware: (10 Jul)
* jedisct1: Julia for R programmers: (10 Jul)
* TWiecki: Plotly: Plotting in the cloud (9 Jul)
* cemerick: nrepl-python-client 0.0.2 released, and now on PyPI thanks to @MaxCountryman #clojure #nrepl (9 Jul)
* ptwobrussell: Lots of great updates to Mining the Social Web's GitHub repo including a newly revised Twitter Cookbook (9 Jul)
* mgrafit: RT @JanWillemTulp This is a great overview of web-based color tools (9 Jul)
* moorejh: Lessons from functional analysis of #GWAS - #Cancer Research #genomics #genetics (9 Jul)
* technomancy: I wrote a little game tonight; once in Chicken Scheme, and once in Clojure: (9 Jul)
* Tojiro: This is pretty awesome: OpenGL tutorials with inline WebGL demonstrations! (9 Jul)
* gvanrossum: mypy: -- a Python variant that combines duck and static typing. Now with compatible syntax! (9 Jul)
* reichlab: really cool. RT @benjaminlind: R on the Cloud is now at #rstats version 3.0.1, has a graphics window ... (9 Jul)
* teddy777: Our next data science "salon" will be focused on modern machine learning methods as well as a bit more formal under… (9 Jul)
* jsundram: I compared @Nodebox to @ProcessingOrg and @processingjs and found some interesting performance differences: (9 Jul)
* arnaudsj: Peter Norvig - Learning Programming: (9 Jul)
* walkingrandomly: Gccpy is an ongoing project to allow users to compile #Python Code as an AOT (ahead of time) compiled language. (8 Jul)
* drago_carlo: Interactive Heatmaps (and Dendrograms) @prismatic #DataMining #MachineLearning #Statistics #Visualization #Rstats (8 Jul)
* drago_carlo: Gephi + MALLET + EMDA via @prismatic #DataMining #TextMining #Networks #NaturalLanguageProcessing (8 Jul)
* albertocairo: To save MT @lisawilliams "@inkouper: A Practical Introduction to Data Science Skills: Notes and Links " #wjchat (8 Jul)
* TWiecki: .@treycausey "scikit-learn really needs to start speaking the language of dataframes." -- here's a start: (8 Jul)
* jattenberg: Predictive Discrete Latent Factor Models for Large Scale Dyadic Data- (8 Jul)
* yokofakun: [delicious] The NGS WikiBook: a dynamic collaborative online training effort with long-term sustainability #tw... (8 Jul)
* Atabey_Kaygun: A stack-based fast data languange by @zedshaw (7 Jul)
* analyticbridge: Veterinary Epidemiologic Research: Modelling Survival Data – Parametric and Frailty Models: (This article w... (7 Jul)
* planetclojure: Exploratory Genomics with Clojure (7 Jul)
* freakonometrics: nice ! "OLS residuals from a regression" ) on @DEAGiles's blog, see (7 Jul)
* arnicas: Interesting! MT @DashingD3js: Using #D3, #backbone and #tornado to visualize histograms of a csv file | D Rodriguez (7 Jul)
* 27point7: On ne rigole pas avec l'anonymisation des stats... (6 Jul)
* Rbloggers: A Recipe for Trellis/Faceted Plots Using Base R Graphics: (This article was first published on Bi... #rstats (6 Jul)
* BrianNosek: If you use GMAIL, this is amazing! Immersion: a people-centric view of your email life (6 Jul)
* UnixToolTip: Emacs Org Mode Version 8: Upgrading and Some Tips (6 Jul)
* alexk_z: Matrix multiplication in R with gputools on a 3 year old NVIDIA GT 330M, 20x faster than CPU atlas libs #impressive (6 Jul)
* ClojureWerkz: 15 tutorials on ClojureScript by @magomimmo . Worth a retweet, don’t you think! (6 Jul)
* paulblaser: Logistic Regression in Python via @yhathq (6 Jul)
* paulblaser: Seven handy SQL features for data scientists via @yhathq (6 Jul)
* yannabraham: Visualization of Multidimensional Data by Bob Jensen (6 Jul)
* statslabdublin: List of Courses at Big Data University Most (all?) of them free too (6 Jul)
* jcukier: Wow, this one just turned out like an abstract visualization (5 Jul)
* rOpenSci: [Blog] From occurrence data to interactive maps on the web #rstats @USGS #geojson (5 Jul)
* kdnuggets: ebook: Instant Weka How-to (5 Jul)
* Atabey_Kaygun: Springer changes its self-archiving policy (5 Jul)
* fonnesbeck: @seanjtaylor here’s one. Check out the Weibull model, line 26 (4 Jul)
* Atabey_Kaygun: Lisp in Parallel (4 Jul)
* drago_carlo: MIxture Discriminant Analysis @Prismatic @Rbloggers #Statistics #DataMining #MachineLearning #Rstats (3 Jul)
* nikete: An Equivalence between the Lasso and Support Vector Machines (3 Jul)
* brikis98: Github now supports "releases", with changelogs, binary assets, and project history. (3 Jul)
* hadleywickham: latest edition of the R journal now available at - 20 articles about #rstats! (3 Jul)
* genetics_blog: New blog post: Customize your .Rprofile and Keep Your Workspace Clean #Rstats (3 Jul)
* JanWillemTulp: Awesome! RT @gnrtvdsgn: We are on github! All code examples of the book in p2.0 or p1.5.1 flavor! #processing (2 Jul)
* Atabey_Kaygun: New post: An implementation of the k-nearest neighbor clustering algorithm in #lisp (2 Jul)
* melb4886: Fascinating review: Social environmental effects on gene regulation #springerlink (2 Jul)
* jjgibaja: R On Cloud (1 Jul)
* mgrafit: Closing the gap btw #rstats and #d3js ! d3Network by @ChrisGandrud + examples (1 Jul)
* drago_carlo: Visualising True Positives and False Positives against Features with scikit-learn @prismatic #MachineLearning #Python (1 Jul)
* msgbi: "@newsycombinator: Algorithms " (1 Jul)