Chardata first working on feature branch by pp-mo · Pull Request #6975 · SciTools/iris

pp-mo · 2026-03-11T01:35:22Z

Successor to #6898
Now targetting (new) FEATURE_chardata feature branch in main repo

TODO: please check that any remaining unresolved issues on #6898 are now resolved here

…Mostly working? Get 'create_cf_data_variable' to call 'create_generic_cf_array_var': Mostly working?

… Cubes.

Rename; addin parts of old investigation; add temporary notes.

…or overlength writes.

…width.

…oxies.

…t_cf_var_data' function.

ukmo-ccbunney

Looking pretty good. 👍🏼
I've got a few comments, questions and suggestions.

I have not looked at the tests yet - just the main Iris code. I thought it was worth submitting the review at this point so you can see the comments. I'll take a look at the tests next.

Also - remind me - what are we doing in the case where data is stored as a netCDF string type - i.e. the variable length string type? At the moment that just loads in as an object array in numpy. Were we just leaving that as-is? We can't write that kind of datatype in Iris.

Edit: Discussed with @pp-mo and he reminded me that we never intended to handle the variable length string cases.

lib/iris/fileformats/netcdf/_bytecoding_datasets.py

lib/iris/fileformats/netcdf/_thread_safe_nc.py

lib/iris/fileformats/netcdf/saver.py

codecov · 2026-03-13T10:52:04Z

Codecov Report

❌ Patch coverage is 92.92308% with 23 lines in your changes missing coverage. Please review.
✅ Project coverage is 90.18%. Comparing base (043b0bc) to head (e4242a1).

Files with missing lines	Patch %	Lines
...ib/iris/fileformats/netcdf/_bytecoding_datasets.py	93.36%	9 Missing and 4 partials ⚠️
lib/iris/fileformats/netcdf/saver.py	91.95%	5 Missing and 2 partials ⚠️
lib/iris/fileformats/netcdf/_thread_safe_nc.py	81.25%	3 Missing ⚠️

Additional details and impacted files

@@                 Coverage Diff                  @@
##           FEATURE_chardata    #6975      +/-   ##
====================================================
+ Coverage             90.11%   90.18%   +0.07%     
====================================================
  Files                    91       92       +1     
  Lines                 24912    25092     +180     
  Branches               4675     4689      +14     
====================================================
+ Hits                  22449    22629     +180     
- Misses                 1684     1688       +4     
+ Partials                779      775       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ukmo-ccbunney

Tests looks sensible as far as I can tell, with the expectation that there will be more coverage added as part of #6898.

lib/iris/tests/unit/fileformats/netcdf/encoding_tests.txt

pp-mo · 2026-03-13T15:36:11Z

lib/iris/fileformats/netcdf/saver.py

            # Except if it already is one, since they forbid "re-wrapping".
            if not hasattr(self._dataset, "THREAD_SAFE_FLAG"):
-                self._dataset = _thread_safe_nc.DatasetWrapper.from_existing(
+                self._dataset = bytecoding_datasets.DatasetWrapper.from_existing(


Oops.
I think this should possibly (also) be an 'EncodedDataset'.
I need to think about this one, I guess it depends on what kind of 'dataset-like' is passed in here.

Suggested change

self._dataset = bytecoding_datasets.DatasetWrapper.from_existing(

self._dataset = bytecoding_datasets.EncodedDataset.from_existing(

lib/iris/fileformats/netcdf/saver.py

pp-mo · 2026-03-13T15:44:38Z

lib/iris/fileformats/netcdf/saver.py

                    # Create a data-writeable object that we can stream into, which
                    # encapsulates the file to be opened + variable to be written.
-                    write_wrapper = _thread_safe_nc.NetCDFWriteProxy(
+                    write_wrapper = bytecoding_datasets.EncodedNetCDFWriteProxy(


Suggested change

write_wrapper = bytecoding_datasets.EncodedNetCDFWriteProxy(

# Note: we do *not* support selectable string encoding for writes,

# so this never needs to be a _thread_safe_nc.NetCDFWriteProxy.

write_wrapper = bytecoding_datasets.EncodedNetCDFWriteProxy(

Co-authored-by: Patrick Peglar <patrick.peglar@metoffice.gov.uk>

ukmo-ccbunney · 2026-03-13T17:36:23Z

Nearly there, I think.
There is just your decision on this comment and a failing doctest/docs build to address.

Edit: The docs failures might be a transient error - seems to be related to a link failure in the InterSphinx links:

intersphinx inventory 'https://pandas.pydata.org/docs/objects.inv' not fetchable due to <class 'requests.exceptions.HTTPError'>: 522 Server Error: <none> for url: https://pandas.pydata.org/docs/objects.inv

https://pandas.pydata.org seems to be unresponsive at the time of wtriting.

pp-mo added 30 commits January 19, 2026 11:49

Initial tests.

041af2d

Get 'create_cf_data_variable' to call 'create_generic_cf_array_var': …

65bd9dd

…Mostly working? Get 'create_cf_data_variable' to call 'create_generic_cf_array_var': Mostly working?

Reinstate decode on load, now in-Iris coded.

d75a7a7

Revert and amend.

07efc06

Hack to preserve the existing order of attributes on saved Coords and…

2321077

… Cubes.

Fix for dataless; avoid FUTURE global state change from temporary tests.

0174e53

Further fix to attribute ordering.

035e28b

Fixes for data packing.

80c4776

Latest test-chararrays.

d4d3ebd

Fix search+replace error.

3f10cc1

Tiny fix in crucial place! (merge error?).

ee2fe4c

Extra mock property prevents weird test crashes.

744826d

Fix another mock problem.

a3e1217

Initial dataset wrappers.

1a4f2f2

Rename; addin parts of old investigation; add temporary notes.

Various notes, choices + changes: Beginnings of encoded-dataset testing.

0148f43

Replace use of encoding functions with test-specific function: Test f…

20a5be2

…or overlength writes.

Radically simplify 'make_bytesarray', by using a known specified byte…

9b621bf

…width.

Add read tests.

b366fd2

Remove iris width control (not in this layer).

cf048b2

more notes

e684d1d

Merge branch 'encoded_datasets' into chardata_plus_encoded_datasets

28b124c

Remove temporary test code.

a20cc45

Use iris categorised warnings for unknown encodings.

c995a8d

Clarify the temporary load/save exercising tests (a bit).

f118c18

Use bytecoded_datasets in nc load+save, begin fixes.

c8a27df

Further attempt to satisfy warning cateogry checker.

c4a31a4

Fix overlength error tests.

10831d7

Get temporary iris load/save exercises working (todo: proper tests).

042028e

Put encoding information into separate converter class, for use in pr…

94b2b21

…oxies.

First proper testing (reads working).

c4b7936

pp-mo added 9 commits March 9, 2026 16:24

Make encoding controls public API.

6e0b34a

Fix old label-loading tests for new chardata handling.

2ca9f6e

Review changes, stylistic only.

b81f4b5

Fix test for new dataset type.

2adf6ab

Remove obsolete not-really-a-test.

7e58f7d

Odd pre-commit fixes, and autoupdate.

0907fe8

Replace all use of 'CFLabelVariable.cf_label_data' with standard '_ge…

bcd3371

…t_cf_var_data' function.

Fix problem with lazy string writing; add tests for lazy writes.

2c6cb0c

Test fixes.

4996be2

pp-mo requested a review from ukmo-ccbunney March 11, 2026 01:35

This was referenced Mar 11, 2026

Chardata plus encoded datasets #6898

Closed

resolve outstanding test failures in chardata handling code #6898 #6919

Open

scitools-ci bot added this to 🚴 Peloton Mar 11, 2026

pp-mo changed the title ~~Chardata plus encoded datasets~~ Chardata first working on feature branch Mar 11, 2026

pp-mo mentioned this pull request Mar 11, 2026

Define the supported encodings, and test them all. pp-mo/iris#125

Closed

ukmo-ccbunney requested changes Mar 11, 2026

View reviewed changes

Make context handler error-safe.

ad3d1b8

pp-mo added 3 commits March 13, 2026 11:39

Define the supported encodings, and test them all.

1c1cf05

Fix utf-16 nchars/nbytes relation.

411b8af

Rationalise error handling + improve messages.

27a6a80

ukmo-ccbunney reviewed Mar 13, 2026

View reviewed changes

lib/iris/tests/unit/fileformats/netcdf/encoding_tests.txt Outdated Show resolved Hide resolved

pp-mo added 3 commits March 13, 2026 14:49

Fix type constructions of EncodedDataset.groups() and similar.

1dce7b5

Removed obsolete reminder notes.

66ebc4d

Allow test_bytecoding_datasets to import netCDF4.

60a7846

pp-mo commented Mar 13, 2026

View reviewed changes

lib/iris/fileformats/netcdf/saver.py Outdated Show resolved Hide resolved

pp-mo commented Mar 13, 2026

View reviewed changes

Update lib/iris/fileformats/netcdf/saver.py

e4242a1

Co-authored-by: Patrick Peglar <patrick.peglar@metoffice.gov.uk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chardata first working on feature branch#6975

Chardata first working on feature branch#6975
pp-mo wants to merge 65 commits intoSciTools:FEATURE_chardatafrom
pp-mo:chardata_plus_encoded_datasets

pp-mo commented Mar 11, 2026 •

edited

Loading

Uh oh!

ukmo-ccbunney left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Mar 13, 2026 •

edited

Loading

Uh oh!

ukmo-ccbunney left a comment

Uh oh!

Uh oh!

pp-mo Mar 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

pp-mo Mar 13, 2026

Uh oh!

ukmo-ccbunney commented Mar 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	self._dataset = bytecoding_datasets.DatasetWrapper.from_existing(
	self._dataset = bytecoding_datasets.EncodedDataset.from_existing(

Conversation

pp-mo commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ukmo-ccbunney left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ukmo-ccbunney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pp-mo Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pp-mo Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

ukmo-ccbunney commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pp-mo commented Mar 11, 2026 •

edited

Loading

ukmo-ccbunney left a comment •

edited

Loading

codecov bot commented Mar 13, 2026 •

edited

Loading

pp-mo Mar 13, 2026 •

edited

Loading

ukmo-ccbunney commented Mar 13, 2026 •

edited

Loading