BUG: Fix multi-index colname references in read_csv c engine. #42519

jmcomie · 2021-07-13T17:38:56Z

This fixes an issue with the read_csv c engine when the input has more than one header row and arguments to dtype, na_values, or converters reference multi-index column names as tuples.

closes option dtype in pandas.read_csv does not work properly for mulilevel columns #42446
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

I added a whatsnew entry for v1.4.0 since that's what other recent bug fixes have done, but I'm not yet sure of the criteria for that. Is it appropriate to add it for whichever release "owns" master?

This fixes an issue with the read_csv c engine when the input has more than one header row and arguments to dtype, na_values, or converters reference multi-index column names as tuples.

jreback

looks fine, some comments

pandas/_libs/parsers.pyx

pandas/tests/io/parser/dtypes/test_dtypes_basic.py

jreback · 2021-07-13T23:00:59Z

cc @gfyoung if you'd have a look

jmcomie · 2021-07-15T01:46:29Z

Regarding the failed check -- I've run the test tests/scalar/timedelta/test_arithmetic.py and it's passing locally. If it's an issue in my branch I will work to fix it but it seems this might be an environmental error ?

jreback · 2021-07-15T04:20:32Z

thanks @jmcomie

failed check is not related

…-dev#42519)

BUG: Fix multi-index colname references in read_csv c engine.

b07a432

This fixes an issue with the read_csv c engine when the input has more than one header row and arguments to dtype, na_values, or converters reference multi-index column names as tuples.

jreback added Bug IO CSV read_csv, to_csv MultiIndex labels Jul 13, 2021

jreback added this to the 1.4 milestone Jul 13, 2021

jreback requested changes Jul 13, 2021

View reviewed changes

pandas/_libs/parsers.pyx Outdated Show resolved Hide resolved

pandas/tests/io/parser/dtypes/test_dtypes_basic.py Show resolved Hide resolved

Updates for CR: comment, if block consolidation.

c38b75e

gfyoung approved these changes Jul 14, 2021

View reviewed changes

jmcomie requested a review from jreback July 14, 2021 18:52

jreback approved these changes Jul 14, 2021

View reviewed changes

jreback merged commit 2ec9862 into pandas-dev:master Jul 15, 2021

feefladder pushed a commit to feefladder/pandas that referenced this pull request Sep 7, 2021

BUG: Fix multi-index colname references in read_csv c engine. (pandas…

d508a49

…-dev#42519)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Fix multi-index colname references in read_csv c engine. #42519

BUG: Fix multi-index colname references in read_csv c engine. #42519

Uh oh!

jmcomie commented Jul 13, 2021

Uh oh!

jreback left a comment

Uh oh!

Uh oh!

Uh oh!

jreback commented Jul 13, 2021

Uh oh!

jmcomie commented Jul 15, 2021

Uh oh!

jreback commented Jul 15, 2021

Uh oh!

Uh oh!

Uh oh!

BUG: Fix multi-index colname references in read_csv c engine. #42519

BUG: Fix multi-index colname references in read_csv c engine. #42519

Uh oh!

Conversation

jmcomie commented Jul 13, 2021

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jreback commented Jul 13, 2021

Uh oh!

jmcomie commented Jul 15, 2021

Uh oh!

jreback commented Jul 15, 2021

Uh oh!

Uh oh!