Skip to content
This repository was archived by the owner on Dec 22, 2019. It is now read-only.

Handle columns index duplicates #7

Merged
merged 4 commits into from
Sep 29, 2018
Merged

Handle columns index duplicates #7

merged 4 commits into from
Sep 29, 2018

Conversation

lowerthansound
Copy link
Owner

@lowerthansound lowerthansound commented Sep 28, 2018

  • Lint code
  • Review
  • Summarize changes

@lowerthansound lowerthansound force-pushed the duplicate branch 6 times, most recently from ec4363b to 82ab6f3 Compare September 29, 2018 22:13
araraonline added 2 commits September 29, 2018 20:40
TODO: This shall be reversed later, or be made a bit more strict. My
best choice is: ignore when it is empty of dtype object, consider if it
is empty of another dtype.

May interact somewhat with the result float64 of reindex.
Will be better made in a future version.
When there were duplicates on the columns index, sort was allowed and
duplicates were allowed if the indexes had the same values (as found by
idx.tolist()).

Now, considering that pandas doesn't allow to sort the index when there
are duplicate values (DataFrame.reindex fails) and that searching for
the same values is counter-productive and prone to fail, depending on
the different types of indexes, the behavior was modified to this:

- When sort=True and there are duplicates in at least one index, an
  error is raised and append stops.
- Dframes with duplicate indexes are only considered to be joined when
  the indexes share the same identity (that is, they are the same object
  comparable with `idx1 is  idx2`)

Some other improvements to the code have also been made and I believe it
is better in a general mode.
@lowerthansound lowerthansound merged commit 5dfe32e into secondtry Sep 29, 2018
@lowerthansound lowerthansound deleted the duplicate branch September 29, 2018 23:44
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant