-
-
Notifications
You must be signed in to change notification settings - Fork 18.7k
Insights: pandas-dev/pandas
Overview
Could not load contribution data
Please try again later
14 Pull requests merged by 10 people
-
TST: nan->NA in non-construction tests
#62021 merged
Aug 4, 2025 -
EHN: return early when the result is None
#62032 merged
Aug 4, 2025 -
Bump pypa/cibuildwheel from 3.1.1 to 3.1.3
#62039 merged
Aug 4, 2025 -
TST: Speed up hypothesis and slow tests
#62028 merged
Aug 3, 2025 -
BUG: Change default of observed in Series.groupby
#62025 merged
Aug 2, 2025 -
BUG: Avoid copying categorical codes if
copy=False
#62000 merged
Aug 1, 2025 -
DOC: Improve documentation for DataFrame.__setitem__ and .loc assignment from Series
#61804 merged
Aug 1, 2025 -
DOC: rephrase CoW ChainedAssignmentError message now CoW is always enabled
#61970 merged
Jul 30, 2025 -
TST: Raise on
pytest.PytestWarning
#61996 merged
Jul 30, 2025 -
BUG/DEPR: logical operation with bool and string
#61995 merged
Jul 29, 2025 -
BUG: groupby.agg with UDF changing pyarrow dtypes
#59601 merged
Jul 29, 2025 -
MNT: simplify
cibuildwheel
configuration#61984 merged
Jul 28, 2025 -
Bump pypa/cibuildwheel from 2.23.3 to 3.1.1
#61981 merged
Jul 28, 2025 -
TST: add test for
dtype
argument instr.decode
#61872 merged
Jul 28, 2025
22 Pull requests opened by 17 people
-
API: offsets.Day is always calendar-day
#61985 opened
Jul 28, 2025 -
Fix warning for extra fields in read_csv with on_bad_lines callable
#61987 opened
Jul 29, 2025 -
BUG: Fix ExtensionArray binary op protocol
#61990 opened
Jul 29, 2025 -
DOC: documenting pandas.MultIndex.argsort
#62005 opened
Jul 30, 2025 -
DOC: Standardize noncompliant docstrings in pandas/io/html.py (flake8-docstrings) #61944
#62007 opened
Jul 31, 2025 -
BUG FIX: pandas.arrays.IntervalArray.overlaps() incorrectly documents that it accepts IntervalArray.
#62009 opened
Jul 31, 2025 -
BUG: Fix assert_series_equal for categoricals with nulls and check_category_order=False (#62008)
#62011 opened
Jul 31, 2025 -
Fix cbusday calendar Typecheck v2
#62014 opened
Jul 31, 2025 -
DOC: Add SSLCertVerificationError warning message for documentation b…
#62015 opened
Jul 31, 2025 -
DOC: Add example for multi-column joins using `merge`
#62016 opened
Aug 1, 2025 -
BUG: Fix assert_series_equal with check_category_order=False for categoricals with nulls
#62017 opened
Aug 1, 2025 -
REF: make copy keyword in recode_for_categories keyword only
#62019 opened
Aug 1, 2025 -
BUG: Fix is_signed_integer_dtype to handle abstract floating types (GH 62018)
#62020 opened
Aug 1, 2025 -
continue from #61957 which closed with unmerged commit
#62023 opened
Aug 2, 2025 -
BUG: groupby.idxmin/idxmax with all NA values should raise
#62026 opened
Aug 2, 2025 -
DOC: fix mask/where docstring alignment note (#61781)
#62029 opened
Aug 2, 2025 -
BUG: Catch TypeError in _is_dtype_type when converting abstract numpy types (#62018)
#62030 opened
Aug 3, 2025 -
API: timestamp resolution inference: default to microseconds when possible
#62031 opened
Aug 3, 2025 -
BUG: raise a proper exception when str.rsplit is passed a regex and clarify the docs
#62035 opened
Aug 3, 2025 -
API: improve dtype in df.where with EA other
#62038 opened
Aug 3, 2025 -
API: mode.nan_is_na to consistently distinguish NaN-vs-NA
#62040 opened
Aug 4, 2025 -
[pre-commit.ci] pre-commit autoupdate
#62041 opened
Aug 4, 2025
16 Issues closed by 7 people
-
issubdtype(<categorical>, np.bool_) raises error
#9581 closed
Aug 2, 2025 -
API: Expand read_csv dtype for categoricals
#14503 closed
Aug 2, 2025 -
Dask shuffle performance help
#43155 closed
Aug 2, 2025 -
ENH: to_offset "year"/"quarter"/"month"/"day"/...?
#34586 closed
Aug 2, 2025 -
inf comparison doesn't emulate NaN with use_inf_as_null
#16756 closed
Aug 1, 2025 -
DOC: Improve documentation for DataFrame.__setitem__ and .loc assignment from Series
#61662 closed
Aug 1, 2025 -
BUG: Inconsistent `datetime` dtype based on how the dataframe gets initialized
#61993 closed
Jul 31, 2025 -
BUG: the behavior of DataFrameGroupBy.apply(..., include_groups=True) breaks post-mortem debugging
#61627 closed
Jul 31, 2025 -
BUG: Writing UUIDs fail
#61602 closed
Jul 31, 2025 -
DOC:
#55125 closed
Jul 30, 2025 -
BUG: Python Package fails to load for some users, but not others.
#61991 closed
Jul 30, 2025 -
BUG (string dtype): logical operation with bool and string failing
#60234 closed
Jul 29, 2025 -
PERF: `pandas.DataFrame.stack` with `future_stack=True`
#61994 closed
Jul 29, 2025 -
BUG: Groupby aggregate coersion of outputs inconsistency for pyarrow dtypes
#61636 closed
Jul 29, 2025 -
BUG: Groupby-aggregate on a boolean column returns a different datatype with pyarrow than with numpy
#53030 closed
Jul 29, 2025 -
DOC: Improve docstrings in utility functions in pandas/core/common.py (lines 176–210)
#61986 closed
Jul 29, 2025
12 Issues opened by 10 people
-
BUG: rank with object dtype and small values
#62036 opened
Aug 3, 2025 -
BUG: Wrong inferred type in case of a mixture of boolean, float and integers
#62033 opened
Aug 3, 2025 -
BUG: NaN vs NA in mixed reduction
#62024 opened
Aug 2, 2025 -
DEPR: convert_dtypes keywords
#62022 opened
Aug 2, 2025 -
BUG: pd.api.types.is_signed_integer_dtype(np.floating) throws TypeError
#62018 opened
Aug 1, 2025 -
BUG: Sql select from database type cast issue
#62013 opened
Jul 31, 2025 -
DOC: Series and DataFrame.reindex accepts Timedelta as tolerance, which is not documented
#62010 opened
Jul 31, 2025 -
BUG: `assert_series_equal` broken with `check_category_order=False` for arrays with null values
#62008 opened
Jul 31, 2025 -
BUG: `IntervalArray.overlaps()` documents that it accepts another `IntervalArray`, but it is not implemented
#62004 opened
Jul 30, 2025 -
ENH: error messages for missing performance dependencies should point out the options
#62001 opened
Jul 30, 2025 -
DOC: documenting pandas.MultIndex.argsort
#61998 opened
Jul 29, 2025 -
DOC: Point out difference in usage of "str" dtype in constructor and astype member
#61992 opened
Jul 29, 2025
85 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
BUG: Fix Series.str.contains with compiled regex on Arrow string dtype (#61942)
#61946 commented on
Jul 30, 2025 • 10 new comments -
BUG: Fix Series.reindex losing values when reindexing to MultiIndex
#61969 commented on
Aug 1, 2025 • 7 new comments -
BUG: Fix TypeError in assert_index_equal when comparing CategoricalIndex with check_categorical=True and exact=False
#61941 commented on
Jul 29, 2025 • 6 new comments -
BUG FIX: Using Series.str.fullmatch() and Series.str.match() with a compiled regex fails with arrow strings
#61964 commented on
Jul 30, 2025 • 5 new comments -
BUG: IntervalIndex.unique() only contains the first interval if all interval borders are negative
#61920 commented on
Aug 1, 2025 • 4 new comments -
ENH: usecols takes input order for read_csv implementation review
#61967 commented on
Aug 3, 2025 • 3 new comments -
ENH: Implement PDEP-17
#61468 commented on
Aug 4, 2025 • 2 new comments -
BUG: Add min/max methods to ArrowExtensionArray GH#61311
#61924 commented on
Aug 1, 2025 • 2 new comments -
API: timestamp resolution inference - default to one unit (if possible) instead of being data-dependent?
#58989 commented on
Aug 3, 2025 • 0 new comments -
BUG/API: round-tripping non-nano datetime64s with to_json/read_json
#55827 commented on
Aug 3, 2025 • 0 new comments -
BUG: pd.Timestamp() defaults to [s] resolution instead of [ns]
#52653 commented on
Aug 3, 2025 • 0 new comments -
API: resolution for date_range, to_datetime, timedelta_range, to_timedelta
#49060 commented on
Aug 3, 2025 • 0 new comments -
BUG: Python 3.14 may not increment refcount
#61368 commented on
Aug 3, 2025 • 0 new comments -
ENH: Add a safe Option to hash_pandas_object with Default Value Set to True
#60428 commented on
Aug 3, 2025 • 0 new comments -
ENH: Add a name parameter to value_counts()
#55121 commented on
Aug 3, 2025 • 0 new comments -
BUG: Instantiating pandas._libs.parsers.TextReader segfaults
#53131 commented on
Aug 4, 2025 • 0 new comments -
BUG: fix .str.isdigit to honor unicode superscript for older pyarrow
#61962 commented on
Jul 29, 2025 • 0 new comments -
BUG: groupby.idxmin/idxmax will all NA values
#57745 commented on
Aug 2, 2025 • 0 new comments -
BUG: ValueError: Buffer dtype mismatch, expected 'intp_t' but got 'long long' on ARMv7 32 bit
#55305 commented on
Aug 2, 2025 • 0 new comments -
BUG: Implicit upcast to float64 for large series (more than 1000000 rows)
#61951 commented on
Aug 2, 2025 • 0 new comments -
Add axis argument to DataFrame.corr
#35002 commented on
Aug 2, 2025 • 0 new comments -
ENH: .interval accessor
#16401 commented on
Aug 2, 2025 • 0 new comments -
Groupby.mode() - feature request
#19254 commented on
Aug 2, 2025 • 0 new comments -
ENH: Add Support for GroupBy Numeric Operations
#20060 commented on
Aug 2, 2025 • 0 new comments -
ENH: Implementing NEP 18's __array_function__
#26380 commented on
Aug 2, 2025 • 0 new comments -
Idea: Make pd.Index.values not writable
#33001 commented on
Aug 2, 2025 • 0 new comments -
DOC: added button to edit on GitHub
#61956 commented on
Aug 4, 2025 • 0 new comments -
TST: run python-dev CI on 3.14-dev
#61950 commented on
Aug 3, 2025 • 0 new comments -
Updating error message to include more details about dataframe
#61929 commented on
Jul 30, 2025 • 0 new comments -
CoW: add readonly flag to ExtensionArrays, return read-only EA/ndarray in .array/EA.to_numpy()
#61925 commented on
Aug 3, 2025 • 0 new comments -
DOC: mention .attrs are preserved in Parquet IO for pyarrow engine
#61912 commented on
Jul 31, 2025 • 0 new comments -
DOC: Clarify broadcasting behavior when using lists in DataFrame arithmetic (GH18857)
#61820 commented on
Jul 30, 2025 • 0 new comments -
BUG: fillna with DataFrame input should preserve dtype when possible
#61742 commented on
Jul 31, 2025 • 0 new comments -
API: consistent NaN treatment for pyarrow dtypes
#61732 commented on
Aug 2, 2025 • 0 new comments -
CI: add PyPI Trusted-Publishing “publish” job to wheels workflow (#61669)
#61718 commented on
Jul 29, 2025 • 0 new comments -
BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes
#61640 commented on
Aug 2, 2025 • 0 new comments -
ENH: Add `force_suffixes` boolean argument to `pd.merge`
#61498 commented on
Aug 2, 2025 • 0 new comments -
Implemented NumbaExecutionEngine
#61487 commented on
Jul 31, 2025 • 0 new comments -
ENH: Added `DataFrame.nsorted` to select top ``n`` rows according to column-dependent order
#61457 commented on
Jul 29, 2025 • 0 new comments -
ENH: support reading directory in read_csv
#61275 commented on
Aug 2, 2025 • 0 new comments -
BUG: Add fillna at the beginning of _where not to fill NA. #60729
#60772 commented on
Jul 30, 2025 • 0 new comments -
BUG: NameError: name 'pa' is not defined despite `pyarrow` is installed
#60573 commented on
Aug 4, 2025 • 0 new comments -
Decrypting file in read_csv
#44097 commented on
Aug 4, 2025 • 0 new comments -
BUG: CustomBusinessDay not respecting calendar
#60647 commented on
Jul 30, 2025 • 0 new comments -
BUG: Series.str.isdigit with pyarrow dtype doesn't honor unicode superscripts
#61466 commented on
Jul 30, 2025 • 0 new comments -
BUG: resampling with origin='end_day' raises ValueError: Values falls before first bin
#44957 commented on
Jul 30, 2025 • 0 new comments -
BUG: Operations not implemented for non-1D ExtensionArrays
#61866 commented on
Jul 30, 2025 • 0 new comments -
TRACKER: new default String dtype (pyarrow-backed, numpy NaN semantics)
#54792 commented on
Jul 29, 2025 • 0 new comments -
BUG: Using `/` operator with a `Path` and `Series` of string dtype fails
#61940 commented on
Jul 29, 2025 • 0 new comments -
BUG: Dataframe.aggregate drops pyarrow backend for lambda aggregation functions
#61812 commented on
Jul 29, 2025 • 0 new comments -
Thousands separator for to_csv
#30045 commented on
Jul 29, 2025 • 0 new comments -
DOC: Reindexing behaviour of dataframe column-assignment missing
#39845 commented on
Jul 29, 2025 • 0 new comments -
ENH: ExcelWriter append or create mode
#61634 commented on
Jul 29, 2025 • 0 new comments -
DOC: Standardize noncompliant docstrings in pandas/io/html.py (flake8-docstrings)
#61944 commented on
Jul 29, 2025 • 0 new comments -
BUG: Boolean Column Indexing Issue in Pandas
#61980 commented on
Jul 29, 2025 • 0 new comments -
BUILD: Installation issue on Mac with M1 Pro arm64 processor. pandas_parser.cpython-311-darwin.so is using x86_64 arch
#61566 commented on
Jul 29, 2025 • 0 new comments -
DOC: shift argument in `Series.shift()` is not used
#61955 commented on
Jul 29, 2025 • 0 new comments -
BUG: `Index` accepts lists as values
#61937 commented on
Jul 29, 2025 • 0 new comments -
BUG: pandas.Index takes multidimensional array as input
#20285 commented on
Jul 29, 2025 • 0 new comments -
DOC: Enforce Numpy Docstring Validation (Parent Issue)
#58063 commented on
Jul 28, 2025 • 0 new comments -
ENH: Allow storing timezone-aware datetimes in a series with a datetime64 dtype
#46998 commented on
Jul 28, 2025 • 0 new comments -
API: Breaking Changes in 3.0 (without deprecations)
#44823 commented on
Jul 28, 2025 • 0 new comments -
BUG: Column of dtype Categorical in DataFrame encounters error when taking a row that includes nan in the column
#58954 commented on
Jul 28, 2025 • 0 new comments -
RLS: 3.0
#57064 commented on
Jul 28, 2025 • 0 new comments -
QST: "Dummy" is rooted in ableist language
#35724 commented on
Aug 2, 2025 • 0 new comments -
ENH: Include df.attrs in to_csv output
#53577 commented on
Aug 2, 2025 • 0 new comments -
ENH: Disallow duplicate column names everywhere by default
#53217 commented on
Aug 2, 2025 • 0 new comments -
ENH: Include `df.attrs` in `to_json` output
#51012 commented on
Aug 2, 2025 • 0 new comments -
BUG: Read hdfs:/// is broken since 2.1.0
#58078 commented on
Aug 1, 2025 • 0 new comments -
'base' argument when resampling has no effect
#22855 commented on
Aug 1, 2025 • 0 new comments -
ENH: convert masked arrays for Series
#20427 commented on
Aug 1, 2025 • 0 new comments -
Read_csv leaks memory when used in multiple threads
#19941 commented on
Aug 1, 2025 • 0 new comments -
BUG: assert_frame_equal(check_dtype=False) fails when comparing two DFs containing pd.NA that only differ in dtype (object vs Int32)
#61473 commented on
Aug 1, 2025 • 0 new comments -
DOC: Docstring for BooleanDType inconsistent with the rest of repo
#61939 commented on
Aug 1, 2025 • 0 new comments -
BUG: 'Sphinx parallel build error' when building docs locally prevents index.html creation
#61975 commented on
Jul 31, 2025 • 0 new comments -
BUG: read_sql no longer works simply with SqlAlchemy selectables and a quick fix
#35484 commented on
Jul 31, 2025 • 0 new comments -
BUG: Merge fails on pyarrow datetime columns
#61926 commented on
Jul 31, 2025 • 0 new comments -
DOC: warn about apply with raw=True, if function returns Optional[int]
#61632 commented on
Jul 31, 2025 • 0 new comments -
BUG: series.groupby(level=0) and series.groupby(pd.Grouper(level=0)) produce different orders
#61943 commented on
Jul 30, 2025 • 0 new comments -
BUG: The lines skipped by pd.read_csv(path, chunksize=n, on_bad_lines='skip') depends on `n`.
#61973 commented on
Jul 30, 2025 • 0 new comments -
DOC: add button to edit on GitHub
#39859 commented on
Jul 30, 2025 • 0 new comments -
DOC: Enhance '10 minute to pandas' by providing video walkthrough to help auditory learners and improve accessibility.
#59359 commented on
Jul 30, 2025 • 0 new comments -
ENH: error message context.
#61834 commented on
Jul 30, 2025 • 0 new comments -
FEEDBACK: PyArrow as a required dependency and PyArrow backed strings
#54466 commented on
Jul 30, 2025 • 0 new comments -
BUG: Inconsistent .values NA/NaN
#61856 commented on
Jul 30, 2025 • 0 new comments