Documentation on Performance considerations #16310

Dr-Irv · 2017-05-09T20:20:07Z

Problem description

We could use a section in the docs about performance considerations. @TomAugspurger has a nice notebook with suggestions here: http://nbviewer.jupyter.org/github/TomAugspurger/pandas-head-to-tail/blob/master/06-Performance.ipynb

An item to add relates to using .assign(), where, if you have a big DataFrame, it is more inefficient in time and memory to use .assign() as opposed to just creating each new column without the use of method chaining via the paradigm of

    df['newcol1'] = df.a + df.b
    df['newcol2'] = df.c + df.d

The text was updated successfully, but these errors were encountered:

jreback · 2017-05-10T11:04:50Z

see also #3871 and #8178

andymaheshw · 2017-05-22T16:44:14Z

Working on it at pycon2017 :)

jorisvandenbossche · 2017-05-23T09:26:56Z

There was also a presentation on PyCon about optimizing pandas: https://github.com/sversh/pycon2017-optimizing-pandas, which can probably be used as well for some inspiration

mroeschke · 2024-01-27T23:04:38Z

Looks like we have https://pandas.pydata.org/pandas-docs/stable/user_guide/enhancingperf.html so closing for now

jreback added Docs Performance Memory or execution speed performance labels May 10, 2017

jreback mentioned this issue May 10, 2017

focus/start enhancing performance section within pandas/numpy #8178

Closed

jreback added this to the Next Major Release milestone May 10, 2017

jreback added Difficulty Novice labels May 10, 2017

andymaheshw mentioned this issue May 22, 2017

WIP - Additional documentation for Performance Improvements using Pandas #16439

Closed

TomAugspurger added the good first issue label Oct 11, 2017

jreback removed the Difficulty Novice label Dec 15, 2017

jbrockmendel removed the Effort Medium label Oct 21, 2019

mroeschke removed this from the Contributions Welcome milestone Oct 13, 2022

mroeschke closed this as completed Jan 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Documentation on Performance considerations #16310

Documentation on Performance considerations #16310

Dr-Irv commented May 9, 2017

jreback commented May 10, 2017

Uh oh!

andymaheshw commented May 22, 2017

Uh oh!

jorisvandenbossche commented May 23, 2017

Uh oh!

mroeschke commented Jan 27, 2024

Uh oh!

Uh oh!

Documentation on Performance considerations #16310

Documentation on Performance considerations #16310

Comments

Dr-Irv commented May 9, 2017

Problem description

jreback commented May 10, 2017

Uh oh!

andymaheshw commented May 22, 2017

Uh oh!

jorisvandenbossche commented May 23, 2017

Uh oh!

mroeschke commented Jan 27, 2024

Uh oh!