Sidebar

R Programming

r_programming
R Programming ericjmorey 9 months ago 100%
Python Rgonomics | Emily Riederer https://emilyriederer.netlify.app/post/py-rgo/

cross-posted from: https://programming.dev/post/8257343 > [Emily Riederer Writes](https://emilyriederer.netlify.app/): > > > Switching languages is about switching mindsets - not just syntax. New developments in python data science toolings, like polars and seaborn’s object interface, can capture the ‘feel’ that converts from R/tidyverse love while opening the door to truly pythonic workflows > > > Just to be clear: > > > > - This is not a post about why python is better than R so R users should switch all their work to python > > - This is not a post about why R is better than python so R semantics and conventions should be forced into python > > - This is not a post about why python users are better than R users so R users need coddling > > - This is not a post about why R users are better than python users and have superior tastes for their toolkit > > - This is not a post about why these python tools are the only good tools and others are bad tools > > > # The Stack > > > > WIth that preamble out of the way, below are a few recommendations for the most ergonomic tools for getting set up, conducting core data analysis, and communication results. > > > > To preview these recommendations: > > > > ### Set Up > > > > - Installation: [pyenv](https://github.com/pyenv/pyenv) > > - IDE: [VS Code](https://code.visualstudio.com/docs/languages/python) > > > > ### Analysis > > > > - Wrangling: [polars](https://pola.rs/) > > - Visualization: [seaborn](https://seaborn.pydata.org/) > > > > ### Communication > > > > - Tables: [Great Tables](https://posit-dev.github.io/great-tables/articles/intro.html) > > - Notebooks: [Quarto](https://quarto.org/) > > > > ### Miscellaneous > > > > - Environment Management: [pdm](https://pdm-project.org/latest/) > > - Code Quality: [ruff](https://docs.astral.sh/ruff/) > > Read [Python Rgonomics](https://emilyriederer.netlify.app/post/py-rgo/)

4
0
r_programming
R Programming ericjmorey 9 months ago 92%
Conda is moving to Mastodon & LinkedIn | conda.org/blog conda.org

cross-posted from: https://discuss.online/post/4110869 > [Conda (@conda@fosstodon.org) writes](https://fosstodon.org/@conda/111654850158557156): > > > Conda is moving our social media presence from Twitter/X to Mastodon and LinkedIn at the start of 2024. It's past time to move into spaces that are welcoming and more in line with our community values. Going forward, you can find us at > 🐘 @conda@fosstodon.org (https://fosstodon.org/@conda) > 🔗 [Conda Community on LinkedIn](https://linkedin.com/company/condacommunity) > > Read [Conda is moving to Mastodon & LinkedIn | conda.org/blog](https://conda.org/blog/2023-12-27-social-move/) > > # Conda (Software) > > Conda provides package, dependency, and environment management for any language. > > Using conda provides a streamlined approach to package management, platform compatibility, environment isolation, and access to an extensive package ecosystem. It is particularly beneficial for data scientists, researchers, and developers working with diverse software requirements across different projects. > > # Conda Community > > The "conda" community is made up of millions of users, packaging maintainers and tool developers. Conda is not a single organization but rather a concerted effort of many different organizations, all devoted to the mission of providing easy access to various types of free software regardless of the operating system or programming language. > > We firmly believe that everyone belongs in open-source, and we want to start by thanking you for taking the time to read this page. What follows is a high level summary of all the projects and organizations which make up the conda community with links provided where you can learn more or get involved yourself. > The many meanings of "conda" > > Traditionally associated with the Anaconda distribution, nowadays the term "conda" refers to more than just a package manager or a software repository. Its many definitions also encompass community packaging efforts like [conda-forge](https://conda-forge.org/) and [bioconda](https://bioconda.github.io/), as well as new tools developed in the [Mamba](https://github.com/mamba-org) and [conda-incubator](https://github.com/conda-incubator/) organizations. All these efforts show that the conda ecosystem is no longer defined by a single actor and continues to grow and thrive. > > Organizations on GitHub include: > > - [@conda](https://github.com/conda), plus Anaconda, Inc. efforts like [@AnacondaRecipes](https://github.com/AnacondaRecipes/), [@anaconda-distribution](https://github.com/anaconda-distribution), [@ContinuumIO](https://github.com/ContinuumIO/) > - [@conda-forge](https://github.com/conda-forge), [@regro](https://github.com/regro/) > - [@conda-incubator](https://github.com/conda-incubator/) & [@conda-tools](https://github.com/conda-tools/) > - [@mamba-org](https://github.com/mamba-org) > - [@bioconda](https://bioconda.github.io/) > > Some tools you might be familiar with are [conda](https://github.com/conda/conda) or [conda-build](https://github.com/conda/conda-build) themselves but also community efforts like [mamba](https://github.com/mamba-org/mamba), [boa](https://github.com/mamba-org/boa), [setup-miniconda](https://github.com/conda-incubator/setup-miniconda), [conda-lock](https://github.com/conda-incubator/conda-tree) or [conda-tree](https://github.com/conda-incubator/conda-tree), among many more. > > Read more about [the conda community](https://conda.org/community).

11
2
r_programming
R Programming ericjmorey 9 months ago 100%
Python equivalent R code, some nuance concerning RStudio discindo.org

2023-12-29 by [Novica Nakov](https://discindo.org/authors/novica/): > I notice that `Ctrl+Enter` for running the code in `Python` and in `R` is not the same thing. Read [the whole article](https://discindo.org/post/100-days-of-python-and-r/)

2
1
r_programming
R Programming produnis 9 months ago 50%
Free R-Manual in German language www.produnis.de

If you are looking for a free R Manual in german language, check out my Book. The PDF-Version has about 450pages.

0
0
r_programming
R Programming ericjmorey 10 months ago 100%
[The Art of] Regression and other stories https://xianblog.wordpress.com/2020/07/23/the-art-of-regression-and-other-stories/

cross-posted from: https://programming.dev/post/7032449 > This will likely be the last (for some time) of my posts about learning resources for Statistical methods and underlying theories for Data Science and Machine Learning foundations. > > [Regression and Other Stories](https://avehtari.github.io/ROS-Examples/) by [Andrew Gelman](http://www.stat.columbia.edu/~gelman/), [Jennifer Hill](https://steinhardt.nyu.edu/people/jennifer-hill), and [Aki Vehtari](https://users.aalto.fi/~ave/), including the [( R ) code and data for the examples](https://avehtari.github.io/ROS-Examples/examples.html). > > The author of the linked review seems generally positive about the text, though they noted some concerns. > > I'm least likely to use this as my primary resource going forward, in part due due [an enthusiastic recommendation](https://feddit.uk/comment/5065500) for [Statistical Rethinking](https://xcelab.net/rm/statistical-rethinking/). But it looks like promising supplemental resource to bridge that gap between theory and application.

5
0
r_programming
R Programming ericjmorey 10 months ago 66%
Statistical Rethinking - A Bayesian Course with Examples in R and Stan (and PyMC3, brms, and Julia too) github.com

cross-posted from: https://programming.dev/post/6990204 > Richard McElreath has made his [course materials](https://github.com/rmcelreath/stat_rethinking_2024) available on GitHub. > > However, the course follows the 2nd edition of McElreath's book [Statistical Rethinking](https://xcelab.net/rm/statistical-rethinking/) which is not available in a free digital format. > > After watching the first lecture in the [Statistical Rethinking 2023 YouTube Playlist](https://www.youtube.com/watch?v=FdnMWdICdRs&list=PLDcUM9US4XdPz-KxHM4XHt7uUVGWWVSus), I might go ahead and purchase the text and use this course instead of Trevor Hastie and Rob Tibshirani's [An Introduction to Statistical Learning (with Applications in R or Python) course](https://programming.dev/post/6942085). > > I also like that this resource has made an explicit attempt to provide code examples in Julia as well as the more popular Python and R. > > I wasn't sure who [Richard McElreath](https://www.eva.mpg.de/ecology/staff/richard-mcelreath/) was so I did a quick search which revealed his position as Director of the Department of Human Behavior, Ecology and Culture at the Max Planck Institute for Evolutionary Anthropology in Leipzig. >

1
0
r_programming
R Programming ericjmorey 10 months ago 88%
An Introduction to Statistical Learning (with Applications in R or Python) https://www.statlearning.com/

cross-posted from: https://programming.dev/post/6942085 > ### Book (Free) > > The resource on statistical methods recommended to me the most has been **[An Introduction to Statistical Learning (with Applications in R or Python)](https://www.statlearning.com/) by Gareth James, Daniela Witten, Trevor Hastie, Rob Tibshirani, and Jonathan Taylor**. Its free to download and has been kept up to date. (The latest edition is from 2022.) > > ### Online Course (Free with optional payment for "Verified Track") > > For those that prefer a structured online course **[StanfordOnline: Statistical Learning with R](https://www.edx.org/learn/statistics/stanford-university-statistical-learning) by Trevor Hastie and Robert Tibshirani** uses An Introduction to Statistical Learning (with Applications in R) as the course textbook. > > ### More In-Depth Book > > Individuals with advanced training in the mathematical sciences may wish to use **[The Elements of Statistical Learning (Data Mining, Inference, and Prediction)](https://hastie.su.domains/ElemStatLearn/) by Trevor Hastie, Robert Tibshirani, and Jerome Friedman** which provides a more comprehensive and detailed treatment of a wider range topics in statistical learning. >

7
0
r_programming
R Programming MalditoBarbudo 10 months ago 100%
Packages built with Rcpp dependency failing R-devel checks at CRAN github.com

If you, like me, maintain any package in CRAN with Rcpp dependency, be aware checks in R-devel are failing. The fix is simple (is in the bug report linked), but, at least in my case, must be done before 12/12 or the packages will be removed from CRAN.

3
0
r_programming
R Programming pglpm 1 year ago 100%
Does subsetting (matrices or arrays) always perform a partial copy?

Some large datasets are pushing memory and some functions I'm writing to the limit. I wanted to ask some questions about subsetting, of matrices and arrays in particular: 1. Does defining a variable as a subset of another lead to copy? For instance ``` x <- matrix(rnorm(20*30), nrow=20, ncol=30) y <- x[, 1:10] ``` Some exploration with [`object_size`](https://rdrr.io/cran/pryr/man/object_size.html) from `pryr` seems to indicate that a copy is made when `y` is created, but I'd like to be sure. 2. If I enter a subset of a matrix/array as argument to a function, does it get copied before the function is started? For instance in ``` x <- matrix(rnorm(20*30), nrow=20, ncol=30) y <- dnorm(0, mean=x[,1:10], sd=1) ``` I wonder if the data in `x[,1:10]` are copied and then given as input to `dnorm`. I've heard that `data.table` allows one to work with subsets without copies being made (unless necessary), but it seems that one is constrained to two dimensions only – no arrays – that way. Cheers!

10
4
r_programming
R Programming Froggy 1 year ago 100%
Are there any generalized data communiyies on lemmy?

I used to frequent r/analytics and r/datascience which are more broad than instances dedicated to a technology or language.

10
2