Allow passing parameters into `to_pandas` call

When creating a `Connection`, currently it's possible to configure some aspects of pandas / arrow, such as `_use_arrow_native_complex_types` and `_use_arrow_native_timestamps`, but `_convert_arrow_table` uses a hardcoded  type mapper that's impossible to change:

https://github.com/databricks/databricks-sql-python/blob/0947b9aa194d37198fa0b05147f50e4682a5643f/src/databricks/sql/client.py#L1349-L1362

Similarly, there's no way to pass other parameters to the `to_pandas` call here:

https://github.com/databricks/databricks-sql-python/blob/0947b9aa194d37198fa0b05147f50e4682a5643f/src/databricks/sql/client.py#L1366-L1370

Please consider extending the connection parameters to add more flexibility.



	dtype_mapping = {
	pyarrow.int8(): pandas.Int8Dtype(),
	pyarrow.int16(): pandas.Int16Dtype(),
	pyarrow.int32(): pandas.Int32Dtype(),
	pyarrow.int64(): pandas.Int64Dtype(),
	pyarrow.uint8(): pandas.UInt8Dtype(),
	pyarrow.uint16(): pandas.UInt16Dtype(),
	pyarrow.uint32(): pandas.UInt32Dtype(),
	pyarrow.uint64(): pandas.UInt64Dtype(),
	pyarrow.bool_(): pandas.BooleanDtype(),
	pyarrow.float32(): pandas.Float32Dtype(),
	pyarrow.float64(): pandas.Float64Dtype(),
	pyarrow.string(): pandas.StringDtype(),
	}

	df = table_renamed.to_pandas(
	types_mapper=dtype_mapping.get,
	date_as_object=True,
	timestamp_as_object=True,
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow passing parameters into `to_pandas` call #578

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Allow passing parameters into to_pandas call #578

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Allow passing parameters into `to_pandas` call #578