From e6bd0e43c14d24b07e92fb00a8d936acc1e46d51 Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Thu, 20 Jan 2022 18:31:40 -0800
Subject: [PATCH 01/23] docs: update RTD versions discussion (latest > dev)
 [skip ci] (#533)

I'll self merge
---
 docs/source/contributing.rst | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/docs/source/contributing.rst b/docs/source/contributing.rst
index 8c72fe8ea..38998bff0 100644
--- a/docs/source/contributing.rst
+++ b/docs/source/contributing.rst
@@ -35,7 +35,8 @@ This diagram depicts the complete workflow we use in the source GitHub repositor
         rel --> main
 
 - ``doc patch``: Updates to the documentation that refer to the current ``echopype`` 
-  release can be pushed out immediately to the `echopype documentation site <https://echopype.readthedocs.io>`_ 
+  release can be pushed out immediately to the 
+  `echopype documentation site <https://echopype.readthedocs.io>`_ 
   by contibuting patches (PRs) to the ``stable`` branch. See `Documentation development`_ 
   below for more details.
 - ``code patch``: Code development is carried out as patches (PRs) to the ``dev``
@@ -131,7 +132,6 @@ and `S3 object-storage <https://en.wikipedia.org/wiki/Amazon_S3>`_ sources,
 the latter via `minio <https://minio.io>`_.
 
 `.ci_helpers/run-test.py <https://github.com/OSOceanAcoustics/echopype/blob/main/.ci_helpers/run-test.py>`_
-
 will execute all tests. The entire test suite can be a bit slow, taking up to 40 minutes
 or more. If your changes impact only some of the subpackages (``convert``, ``calibrate``, 
 ``preprocess``, etc), you can run ``run-test.py`` with only a subset of tests by passing
@@ -142,7 +142,6 @@ as an argument a comma-separated list of the modules that have changed. For exam
     python .ci_helpers/run-test.py --local --pytest-args="-vv" echopype/calibrate/calibrate_ek.py,echopype/preprocess/noise_est.py
 
 will run only tests associated with the ``calibrate`` and ``preprocess`` subpackages.
-
 For ``run-test.py`` usage information, use the ``-h`` argument:
 ``python .ci_helpers/run-test.py -h``
 
@@ -224,12 +223,13 @@ Documentation versions
 `<https://echopype.readthedocs.io>`_ redirects to the documentation ``stable`` version, 
 `<https://echopype.readthedocs.io/en/stable/>`_, which is built from the ``stable`` branch 
 on the ``echopype`` GitHub repository. In addition, the ``latest`` version 
-(`<https://echopype.readthedocs.io/en/latest/>`_) is built from the ``main`` branch, 
-while the hidden `dev` version (`<https://echopype.readthedocs.io/en/dev/>`_) is built 
-from the ``dev`` branch. Finally, each new echopype release is built as a new release version 
-on ReadTheDocs. Merging pull requests into any of these three branches or issuing a 
-new tagged release will automatically result in a new ReadTheDocs build for the 
+(`<https://echopype.readthedocs.io/en/latest/>`_) is built from the ``dev`` branch and 
+therefore it reflects the bleeding edge development code (which may occasionally break
+the documenation build). Finally, each new echopype release is built as a new release version 
+on ReadTheDocs. Merging pull requests into ``stable`` or ``dev`` or issuing a new 
+tagged release will automatically result in a new ReadTheDocs build for the 
 corresponding version.
 
 We also maintain a test version of the documentation at `<https://doc-test-echopype.readthedocs.io/>`_
 for viewing and debugging larger, more experimental changes, typically from a separate fork.
+This version is used to test one-off, major breaking changes.

From a5912e77b28a5a5e312e5988d2b2743e4830b0bd Mon Sep 17 00:00:00 2001
From: b-reyes <reyesb123@gmail.com>
Date: Tue, 1 Feb 2022 14:34:22 -0800
Subject: [PATCH 02/23] add a period

---
 docs/source/index.rst | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/source/index.rst b/docs/source/index.rst
index ce223ac67..4c64ddb95 100644
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -1,4 +1,4 @@
-.. echopype documentation master file, created by
+OA.. echopype documentation master file, created by
    sphinx-quickstart on Wed Feb 13 15:33:27 2019.
    You can adapt this file completely to your liking, but it should at least
    contain the root `toctree` directive.
@@ -17,7 +17,7 @@ However, most of the new data remain under-utilized.
 echopype aims to address the root cause of this problem - the lack of
 interoperable data format and scalable analysis workflows that adapt well
 with increasing data volume - by providing open-source tools as entry points for
-scientists to make discovery using these new data.
+scientists to make discovery using these new data. .
 
 
 Documentation

From 5aaa231f8fc0799780017589e25981ccedcf991b Mon Sep 17 00:00:00 2001
From: b-reyes <reyesb123@gmail.com>
Date: Wed, 2 Feb 2022 09:01:41 -0800
Subject: [PATCH 03/23] Change name of installation section and add an
 additional examples section

---
 docs/source/index.rst        |  4 ++--
 docs/source/installation.rst | 16 +++++++++++++++-
 docs/source/resources.rst    |  1 +
 3 files changed, 18 insertions(+), 3 deletions(-)

diff --git a/docs/source/index.rst b/docs/source/index.rst
index 4c64ddb95..ce223ac67 100644
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -1,4 +1,4 @@
-OA.. echopype documentation master file, created by
+.. echopype documentation master file, created by
    sphinx-quickstart on Wed Feb 13 15:33:27 2019.
    You can adapt this file completely to your liking, but it should at least
    contain the root `toctree` directive.
@@ -17,7 +17,7 @@ However, most of the new data remain under-utilized.
 echopype aims to address the root cause of this problem - the lack of
 interoperable data format and scalable analysis workflows that adapt well
 with increasing data volume - by providing open-source tools as entry points for
-scientists to make discovery using these new data. .
+scientists to make discovery using these new data.
 
 
 Documentation
diff --git a/docs/source/installation.rst b/docs/source/installation.rst
index c1056acc7..996542804 100644
--- a/docs/source/installation.rst
+++ b/docs/source/installation.rst
@@ -1,5 +1,9 @@
+Installation and Examples
+=========================
+
+
 Installation
-============
+------------
 
 Echopype is available and tested for Python>=3.7. The latest release 
 can be installed from `PyPI <https://pypi.org/project/echopype/>`_:
@@ -18,3 +22,13 @@ Previous releases are also available on PyPI and conda.
 
 For instructions on installing a development version of echopype,
 see the :doc:`contributing` page.
+
+
+Examples
+--------
+
+Additional `Jupyter notebooks <https://osoceanacoustics.github.io/echopype-examples/>`_
+illustrating the workflow of Echopype are also made available to the public. These
+examples include a quick tour of Echopype, a demonstration of how Echopype can be used
+to explore ship echosounder data from a Pacific Hake survey, and using Echopype to
+visualize the response of zooplankton to a solar eclipse.
\ No newline at end of file
diff --git a/docs/source/resources.rst b/docs/source/resources.rst
index 8ef2ad7c2..02d677530 100644
--- a/docs/source/resources.rst
+++ b/docs/source/resources.rst
@@ -1,6 +1,7 @@
 Other resources
 ================
 
+
 Software
 --------
 

From f9a74d254a357c3a58fd6c21338e82efb323ddd7 Mon Sep 17 00:00:00 2001
From: b-reyes <reyesb123@gmail.com>
Date: Wed, 2 Feb 2022 11:17:37 -0800
Subject: [PATCH 04/23] remove "to the public"

---
 docs/source/installation.rst | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/source/installation.rst b/docs/source/installation.rst
index 996542804..61b9e2c9b 100644
--- a/docs/source/installation.rst
+++ b/docs/source/installation.rst
@@ -28,7 +28,7 @@ Examples
 --------
 
 Additional `Jupyter notebooks <https://osoceanacoustics.github.io/echopype-examples/>`_
-illustrating the workflow of Echopype are also made available to the public. These
+illustrating the workflow of Echopype are also made available. These
 examples include a quick tour of Echopype, a demonstration of how Echopype can be used
 to explore ship echosounder data from a Pacific Hake survey, and using Echopype to
 visualize the response of zooplankton to a solar eclipse.
\ No newline at end of file

From e22c788768937a10808105ba011ed5ced83a0127 Mon Sep 17 00:00:00 2001
From: leewujung <leewujung@gmail.com>
Date: Thu, 10 Feb 2022 19:56:31 -0800
Subject: [PATCH 05/23] add remnant of conflict resolution

---
 docs/source/contributing.rst | 17 -----------------
 1 file changed, 17 deletions(-)

diff --git a/docs/source/contributing.rst b/docs/source/contributing.rst
index a29a104d2..9934d2433 100644
--- a/docs/source/contributing.rst
+++ b/docs/source/contributing.rst
@@ -34,16 +34,10 @@ This diagram depicts the complete workflow we use in the source GitHub repositor
         dev --> |dev merge| rel
         rel --> main
 
-<<<<<<< HEAD
 - ``doc patch``: Updates to the documentation that refer to the current ``echopype`` 
   release can be pushed out immediately to the 
   `echopype documentation site <https://echopype.readthedocs.io>`_ 
   by contibuting patches (PRs) to the ``stable`` branch. See `Documentation development`_ 
-=======
-- ``doc patch``: Updates to the documentation that refer to the current ``echopype``
-  release can be pushed out immediately to the `echopype documentation site <https://echopype.readthedocs.io>`_
-  by contibuting patches (PRs) to the ``stable`` branch. See `Documentation development`_
->>>>>>> main
   below for more details.
 - ``code patch``: Code development is carried out as patches (PRs) to the ``dev``
   branch; changes in the documentation corresponding to changes in the code can be
@@ -226,7 +220,6 @@ and adding a new section that documents a previously undocumented feature.
 Documentation versions
 ~~~~~~~~~~~~~~~~~~~~~~
 
-<<<<<<< HEAD
 `<https://echopype.readthedocs.io>`_ redirects to the documentation ``stable`` version, 
 `<https://echopype.readthedocs.io/en/stable/>`_, which is built from the ``stable`` branch 
 on the ``echopype`` GitHub repository. In addition, the ``latest`` version 
@@ -235,16 +228,6 @@ therefore it reflects the bleeding edge development code (which may occasionally
 the documenation build). Finally, each new echopype release is built as a new release version 
 on ReadTheDocs. Merging pull requests into ``stable`` or ``dev`` or issuing a new 
 tagged release will automatically result in a new ReadTheDocs build for the 
-=======
-`<https://echopype.readthedocs.io>`_ redirects to the documentation ``stable`` version,
-`<https://echopype.readthedocs.io/en/stable/>`_, which is built from the ``stable`` branch
-on the ``echopype`` GitHub repository. In addition, the ``latest`` version
-(`<https://echopype.readthedocs.io/en/latest/>`_) is built from the ``main`` branch,
-while the hidden `dev` version (`<https://echopype.readthedocs.io/en/dev/>`_) is built
-from the ``dev`` branch. Finally, each new echopype release is built as a new release version
-on ReadTheDocs. Merging pull requests into any of these three branches or issuing a
-new tagged release will automatically result in a new ReadTheDocs build for the
->>>>>>> main
 corresponding version.
 
 We also maintain a test version of the documentation at `<https://doc-test-echopype.readthedocs.io/>`_

From 03a84d4fbd4f4fc999866397fa407f61855de2c0 Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Sun, 13 Feb 2022 21:15:40 -0800
Subject: [PATCH 06/23] Add 0.5.6 to What's new (#564)

* docs: update RTD versions discussion (latest > dev) [skip ci]

* docs: Add 0.5.6 to What's new

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
---
 docs/source/contributing.rst | 22 ++++++++--------
 docs/source/installation.rst |  2 +-
 docs/source/whats-new.rst    | 50 ++++++++++++++++++++++++++++++++++++
 3 files changed, 62 insertions(+), 12 deletions(-)

diff --git a/docs/source/contributing.rst b/docs/source/contributing.rst
index 9934d2433..8bb4b3a97 100644
--- a/docs/source/contributing.rst
+++ b/docs/source/contributing.rst
@@ -34,10 +34,10 @@ This diagram depicts the complete workflow we use in the source GitHub repositor
         dev --> |dev merge| rel
         rel --> main
 
-- ``doc patch``: Updates to the documentation that refer to the current ``echopype`` 
-  release can be pushed out immediately to the 
-  `echopype documentation site <https://echopype.readthedocs.io>`_ 
-  by contibuting patches (PRs) to the ``stable`` branch. See `Documentation development`_ 
+- ``doc patch``: Updates to the documentation that refer to the current ``echopype``
+  release can be pushed out immediately to the
+  `echopype documentation site <https://echopype.readthedocs.io>`_
+  by contibuting patches (PRs) to the ``stable`` branch. See `Documentation development`_
   below for more details.
 - ``code patch``: Code development is carried out as patches (PRs) to the ``dev``
   branch; changes in the documentation corresponding to changes in the code can be
@@ -220,14 +220,14 @@ and adding a new section that documents a previously undocumented feature.
 Documentation versions
 ~~~~~~~~~~~~~~~~~~~~~~
 
-`<https://echopype.readthedocs.io>`_ redirects to the documentation ``stable`` version, 
-`<https://echopype.readthedocs.io/en/stable/>`_, which is built from the ``stable`` branch 
-on the ``echopype`` GitHub repository. In addition, the ``latest`` version 
-(`<https://echopype.readthedocs.io/en/latest/>`_) is built from the ``dev`` branch and 
+`<https://echopype.readthedocs.io>`_ redirects to the documentation ``stable`` version,
+`<https://echopype.readthedocs.io/en/stable/>`_, which is built from the ``stable`` branch
+on the ``echopype`` GitHub repository. In addition, the ``latest`` version
+(`<https://echopype.readthedocs.io/en/latest/>`_) is built from the ``dev`` branch and
 therefore it reflects the bleeding edge development code (which may occasionally break
-the documenation build). Finally, each new echopype release is built as a new release version 
-on ReadTheDocs. Merging pull requests into ``stable`` or ``dev`` or issuing a new 
-tagged release will automatically result in a new ReadTheDocs build for the 
+the documentation build). Finally, each new echopype release is built as a new release version
+on ReadTheDocs. Merging pull requests into ``stable`` or ``dev`` or issuing a new
+tagged release will automatically result in a new ReadTheDocs build for the
 corresponding version.
 
 We also maintain a test version of the documentation at `<https://doc-test-echopype.readthedocs.io/>`_
diff --git a/docs/source/installation.rst b/docs/source/installation.rst
index ee4cf1645..470e5ea3a 100644
--- a/docs/source/installation.rst
+++ b/docs/source/installation.rst
@@ -31,4 +31,4 @@ Additional `Jupyter notebooks <https://osoceanacoustics.github.io/echopype-examp
 illustrating the workflow of Echopype are also made available. These
 examples include a quick tour of Echopype, a demonstration of how Echopype can be used
 to explore ship echosounder data from a Pacific Hake survey, and using Echopype to
-visualize the response of zooplankton to a solar eclipse.
\ No newline at end of file
+visualize the response of zooplankton to a solar eclipse.
diff --git a/docs/source/whats-new.rst b/docs/source/whats-new.rst
index 8be4d3d37..47d5d2b83 100644
--- a/docs/source/whats-new.rst
+++ b/docs/source/whats-new.rst
@@ -3,6 +3,56 @@ What's new
 
 See `GitHub releases page <https://github.com/OSOceanAcoustics/echopype/releases>`_ for the complete history.
 
+v0.5.6 (2022 Feb 10)
+--------------------
+
+Overview
+~~~~~~~~
+
+This is a minor release that contains an experimental new feature and a number of enhancements, clean-up and bug fixes, which pave the way for the next major release.
+
+New feature
+~~~~~~~~~~~
+
+- (beta) Allow interpolating CTD data in calibration (#464)
+
+  - Interpolation currently allowed along the ``ping_time`` dimension (the ``"stationary"`` case) and across ``latitude`` and ``longitude`` (the ``"mobile"`` case).
+  - This mechanism is enabled via a new ``EnvParams`` class at input of calibration functions.
+
+Enhancements
+~~~~~~~~~~~~
+
+- Make visualize module fully optional with ``matplotlib``, ``cmocean`` being optional dependency (#526, #559)
+- Set range entries with no backscatter data to NaN in output of ``echodata.compute_range()`` (#547) and still allows quick visualization (#555)
+- Add ``codespell`` GitHub action to ensure correct spellings of words (#557)
+- Allow ``sonar_model="EA640"`` for ``open_raw`` (before it had to be "EK80") (#539)
+
+Bug fixes
+~~~~~~~~~
+
+- Allow using ``sonar_model="EA640"`` (#538, #539)
+- Allow flexible and empty environment variables in EA640/EK80 files (#537)
+- Docstring overhaul and fix bugs in ``utils.uwa`` (#525)
+
+Documentation
+~~~~~~~~~~~~~
+
+- Upgrade echopype docs to use jupyter book (#543)
+- Change the RTD ``latest`` to point to the ``dev`` branch (#467)
+
+Testing
+~~~~~~~
+
+- Update convert tests to enable parallel testing (#556)
+- Overhaul tests (#523, #498)
+
+  - use ``pytest.fixture`` for testing
+  - add ES70/ES80/EA640 test files
+  - add new EK80 small test files with parameter combinations
+  - reduce size for a subset of large EK80 test data files
+
+- Add packaging testing for the ``dev`` branch (#554)
+
 
 v0.5.5 (2021 Dec 10)
 --------------------

From 5b3167a42c12fe950393bd5f01fcb85a5e08b9b0 Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Wed, 13 Apr 2022 18:57:23 -0700
Subject: [PATCH 07/23] docs: Update broken netcdf url [skip ci] (#627)

---
 docs/source/why.rst | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/source/why.rst b/docs/source/why.rst
index 73b2d6260..cd0c6cdb0 100644
--- a/docs/source/why.rst
+++ b/docs/source/why.rst
@@ -21,7 +21,7 @@ other climate and oceanographic data sets, facilitating the integration of
 ocean sonar data in interdisciplinary oceanographic research.
 
 .. _netCDF:
-   https://www.unidata.ucar.edu/software/netcdf/docs/netcdf_introduction.html
+   https://www.unidata.ucar.edu/software/netcdf/
 .. _xarray: http://xarray.pydata.org/
 .. _dask: http://dask.pydata.org/
 .. _pandas: https://pandas.pydata.org/

From 17271a74deb2fc105a59395b3993dcb7c922e5c1 Mon Sep 17 00:00:00 2001
From: Wu-Jung Lee <leewujung@gmail.com>
Date: Tue, 19 Apr 2022 15:08:12 -0700
Subject: [PATCH 08/23] update creating conda dev env instructions (#633)

---
 docs/source/contributing.rst | 9 ++++++++-
 1 file changed, 8 insertions(+), 1 deletion(-)

diff --git a/docs/source/contributing.rst b/docs/source/contributing.rst
index 8bb4b3a97..13dd0e3e8 100644
--- a/docs/source/contributing.rst
+++ b/docs/source/contributing.rst
@@ -77,11 +77,18 @@ Create a `conda <https://docs.conda.io>`_ environment for echopype development
 
 .. code-block:: bash
 
-    conda create -c conda-forge -n echopype --yes python=3.9 --file requirements.txt --file requirements-dev.txt
+    # create conda environment using the supplied requirements files
+    # note the last one docs/requirements.txt is only required for building docs
+    conda create -c conda-forge -n echopype --yes python=3.9 --file requirements.txt --file requirements-dev.txt --file docs/requirements.txt
+
+    # switch to the newly built environment
     conda activate echopype
+
     # ipykernel is recommended, in order to use with JupyterLab and IPython
     # to aid with development. We recommend you install JupyterLab separately
     conda install -c conda-forge ipykernel
+
+    # install echopype in editable mode (setuptools "develop mode")
     pip install -e .
 
 See the :doc:`installation` page to simply install the latest echopype release from conda or PyPI.

From a874b9315e4e73c27efcbd78e1760b7af3d44bd0 Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Mon, 23 May 2022 15:03:38 -0700
Subject: [PATCH 09/23] Update contributors text [skip ci] (#703)

* docs: update contributors text [skip ci]

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
---
 README.md             | 11 +++------
 docs/source/_toc.yml  |  2 +-
 docs/source/index.md  | 42 +++++++++++++++++++++++++++++++
 docs/source/index.rst | 57 -------------------------------------------
 4 files changed, 46 insertions(+), 66 deletions(-)
 create mode 100644 docs/source/index.md
 delete mode 100644 docs/source/index.rst

diff --git a/README.md b/README.md
index bf50746f2..5d1442a7f 100644
--- a/README.md
+++ b/README.md
@@ -67,19 +67,14 @@ Please report any bugs by [creating issues on GitHub](https://medium.com/nyc-pla
 Contributors
 ------------
 
-[Wu-Jung Lee](http://leewujung.github.io) (@leewujung) leads this project and together with
-[Kavin Nguyen](https://github.com/ngkavin) (@ngkavin), [Landung "Don" Setiawan](https://github.com/lsetiawan) (@lsetiawan), and [Imran Majeed](https://github.com/imranmaj) (@imranmaj) are primary developers of this package.
-[Emilio Mayorga](https://www.apl.washington.edu/people/profile.php?last_name=Mayorga&first_name=Emilio) (@emiliom)
-and [Valentina Staneva](https://escience.washington.edu/people/valentina-staneva/) (@valentina-s)
-are also part of the development team.
+Wu-Jung Lee ([@leewujung](https://github.com/leewujung)) founded the echopype project in 2018. It is currently led by Wu-Jung Lee and Emilio Mayorga ([@emiliom](https://github.com/emiliom)), who are primary developers together with Brandon Reyes ([@b-reyes](https://github.com/b-reyes)), Landung "Don" Setiawan ([@lsetiawan](https://github.com/lsetiawan)), and previously Kavin Nguyen ([@ngkavin](https://github.com/ngkavin)) and Imran Majeed ([@imranmaj](https://github.com/imranmaj)). Valentina Staneva ([@valentina-s](https://github.com/valentina-s)) is also part of the development team.
 
-Other contributors are listed in [echopype documentation](https://echopype.readthedocs.io).
 
 We thank Dave Billenness of ASL Environmental Sciences for
 providing the AZFP Matlab Toolbox as reference for our
 development of AZFP support in echopype.
-We also thank [Rick Towler](https://github.com/rhtowler) (@rhtowler)
-of the Alaska Fisheries Science Center
+We also thank Rick Towler ([@rhtowler](https://github.com/rhtowler))
+of the NOAA Alaska Fisheries Science Center
 for providing low-level file parsing routines for
 Simrad EK60 and EK80 echosounders.
 
diff --git a/docs/source/_toc.yml b/docs/source/_toc.yml
index 328217844..535087490 100644
--- a/docs/source/_toc.yml
+++ b/docs/source/_toc.yml
@@ -2,7 +2,7 @@
 # Learn more at https://jupyterbook.org/customize/toc.html
 
 format: jb-book
-root: index.rst
+root: index
 parts:
 - caption: Getting Started
   chapters:
diff --git a/docs/source/index.md b/docs/source/index.md
new file mode 100644
index 000000000..b58bea38f
--- /dev/null
+++ b/docs/source/index.md
@@ -0,0 +1,42 @@
+# Welcome to echopype!
+
+**Echopype** is a package built to enable interoperability and scalability
+in ocean sonar data processing.
+These data are widely used for obtaining information about the distribution and
+abundance of marine animals, such as fish and krill.
+Our ability to collect large volumes of sonar data from a variety of
+ocean platforms has grown significantly in the last decade.
+However, most of the new data remain under-utilized.
+echopype aims to address the root cause of this problem - the lack of
+interoperable data format and scalable analysis workflows that adapt well
+with increasing data volume - by providing open-source tools as entry points for
+scientists to make discovery using these new data.
+
+
+## Contributors
+
+Wu-Jung Lee ([@leewujung](https://github.com/leewujung)) founded the echopype project in 2018. It is currently led by Wu-Jung Lee and Emilio Mayorga ([@emiliom](https://github.com/emiliom)), who are primary developers together with Brandon Reyes ([@b-reyes](https://github.com/b-reyes)), Landung "Don" Setiawan ([@lsetiawan](https://github.com/lsetiawan)), and previously Kavin Nguyen ([@ngkavin](https://github.com/ngkavin)) and Imran Majeed ([@imranmaj](https://github.com/imranmaj)). Valentina Staneva ([@valentina-s](https://github.com/valentina-s)) is also part of the development team.
+
+Other contributors include:
+Frederic Cyr ([@cyrf0006](https://github.com/cyrf0006)),
+Paul Robinson ([@prarobinson](https://github.com/prarobinson)),
+Sven Gastauer ([@SvenGastauer](https://github.com/SvenGastauer)),
+Marian Peña ([@marianpena](https://github.com/marianpena)),
+Mark Langhirt ([@bnwkeys](https://github.com/bnwkeys)),
+Erin LaBrecque ([@erinann](https://github.com/erinann)),
+Emma Ozanich ([@emma-ozanich](https://github.com/emma-ozanich)),
+Aaron Marburg ([@amarburg](https://github.com/amarburg)). A complete list of direct contributors is on our [GitHub Contributors Page](https://github.com/OSOceanAcoustics/echopype/graphs/contributors).
+
+We thank Dave Billenness of ASL Environmental Sciences for
+providing the AZFP Matlab Toolbox as reference for our
+development of AZFP support in echopype.
+We also thank Rick Towler ([@rhtowler](https://github.com/rhtowler))
+of the NOAA Alaska Fisheries Science Center
+for providing low-level file parsing routines for
+Simrad EK60 and EK80 echosounders.
+
+
+## License
+
+Echopype is licensed under the open source
+[Apache 2.0 license](https://opensource.org/licenses/Apache-2.0).
diff --git a/docs/source/index.rst b/docs/source/index.rst
deleted file mode 100644
index cdd5a5067..000000000
--- a/docs/source/index.rst
+++ /dev/null
@@ -1,57 +0,0 @@
-.. echopype documentation master file, created by
-   sphinx-quickstart on Wed Feb 13 15:33:27 2019.
-   You can adapt this file completely to your liking, but it should at least
-   contain the root `toctree` directive.
-
-
-Welcome to echopype!
-====================
-
-**Echopype** is a package built to enable interoperability and scalability
-in ocean sonar data processing.
-These data are widely used for obtaining information about the distribution and
-abundance of marine animals, such as fish and krill.
-Our ability to collect large volumes of sonar data from a variety of
-ocean platforms has grown significantly in the last decade.
-However, most of the new data remain under-utilized.
-echopype aims to address the root cause of this problem - the lack of
-interoperable data format and scalable analysis workflows that adapt well
-with increasing data volume - by providing open-source tools as entry points for
-scientists to make discovery using these new data.
-
-Contributors
-------------
-
-`Wu-Jung Lee <http://leewujung.github.io>`_ (@leewujung) leads this project
-and together with `Kavin Nguyen <https://github.com/ngkavin>`_ (@ngkavin),
-`Landung "Don" Setiawan <https://github.com/lsetiawan>`_ (@lsetiawan),
-and `Imran Majeed <https://github.com/imranmaj>`_ (@imranmaj)
-are primary developers of this package.
-`Emilio Mayorga <https://www.apl.washington.edu/people/profile.php?last_name=Mayorga&first_name=Emilio>`_ (@emiliom)
-and `Valentina Staneva <https://escience.washington.edu/people/valentina-staneva/>`_ (@valentina-s)
-are also part of the development team.
-
-Other contributors include:
-`Frederic Cyr <https://github.com/cyrf0006>`_ (@cyrf0006),
-`Paul Robinson <https://github.com/prarobinson/>`_ (@prarobinson),
-`Sven Gastauer <https://www.researchgate.net/profile/Sven_Gastauer>`_ (@SvenGastauer),
-`Marian Peña <https://www.researchgate.net/profile/Marian_Pena2>`_ (@marianpena),
-`Mark Langhirt <https://www.linkedin.com/in/mark-langhirt-7b33ba80>`_ (@bnwkeys),
-`Erin LaBrecque <https://www.linkedin.com/in/erin-labrecque/>`_ (@erinann),
-`Emma Ozanich <https://www.linkedin.com/in/emma-reeves-ozanich-b8671938/>`_ (@emma-ozanich),
-`Aaron Marburg <http://apl.uw.edu/people/profile.php?last_name=Marburg&first_name=Aaron>`_ (@amarburg)
-
-We thank Dave Billenness of ASL Environmental Sciences for
-providing the AZFP Matlab Toolbox as reference for our
-development of AZFP support in echopype.
-We also thank `Rick Towler <https://github.com/rhtowler>`_ (@rhtowler)
-of the Alaska Fisheries Science Center
-for providing low-level file parsing routines for
-Simrad EK60 and EK80 echosounders.
-
-
-License
--------
-
-Echopype is licensed under the open source
-`Apache 2.0 license <https://opensource.org/licenses/Apache-2.0>`_.

From 7ce7c5a8e5ba92de4ac8cdbe6293b8ab66a73e4a Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Mon, 23 May 2022 15:09:44 -0700
Subject: [PATCH 10/23] Update README.md

---
 README.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/README.md b/README.md
index 5d1442a7f..c7d4fa31c 100644
--- a/README.md
+++ b/README.md
@@ -69,6 +69,7 @@ Contributors
 
 Wu-Jung Lee ([@leewujung](https://github.com/leewujung)) founded the echopype project in 2018. It is currently led by Wu-Jung Lee and Emilio Mayorga ([@emiliom](https://github.com/emiliom)), who are primary developers together with Brandon Reyes ([@b-reyes](https://github.com/b-reyes)), Landung "Don" Setiawan ([@lsetiawan](https://github.com/lsetiawan)), and previously Kavin Nguyen ([@ngkavin](https://github.com/ngkavin)) and Imran Majeed ([@imranmaj](https://github.com/imranmaj)). Valentina Staneva ([@valentina-s](https://github.com/valentina-s)) is also part of the development team.
 
+Other contributors are listed in [echopype documentation](https://echopype.readthedocs.io).
 
 We thank Dave Billenness of ASL Environmental Sciences for
 providing the AZFP Matlab Toolbox as reference for our

From 4eb435bd07e1d4789efe96402661eb2021a9d6d3 Mon Sep 17 00:00:00 2001
From: Wu-Jung Lee <leewujung@gmail.com>
Date: Wed, 29 Jun 2022 07:28:31 -0700
Subject: [PATCH 11/23] Fix python version requirements in docs (#744)

* update required python version to 3.8 due to xarray requirements

* add sphinx-panels to docs/requirements
---
 docs/requirements.txt        | 1 +
 docs/source/installation.rst | 2 +-
 2 files changed, 2 insertions(+), 1 deletion(-)

diff --git a/docs/requirements.txt b/docs/requirements.txt
index 6e6afca68..a91018353 100644
--- a/docs/requirements.txt
+++ b/docs/requirements.txt
@@ -1,5 +1,6 @@
 sphinx_rtd_theme
 sphinx-automodapi
+sphinx-panels
 sphinxcontrib-mermaid
 jupyter-book
 numpydoc
diff --git a/docs/source/installation.rst b/docs/source/installation.rst
index 470e5ea3a..2726009d5 100644
--- a/docs/source/installation.rst
+++ b/docs/source/installation.rst
@@ -5,7 +5,7 @@ Installation and Examples
 Installation
 ------------
 
-Echopype is available and tested for Python>=3.7. The latest release
+Echopype is available and tested for Python>=3.8. The latest release
 can be installed from `PyPI <https://pypi.org/project/echopype/>`_:
 
 .. code-block:: console

From a3a259fbba421b65fba7e33f77415bacee305e4b Mon Sep 17 00:00:00 2001
From: Emilio Mayorga <emiliomayorga@gmail.com>
Date: Wed, 13 Jul 2022 00:56:26 -0700
Subject: [PATCH 12/23] Updates to "Contributing to echopype" doc page [skip
 ci] (#764)

* Update contributing page, CI flags and dev installation instructions

* docs: CI actions flags now via PR title; mamba note; quote .[plot] for multi platform support

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
---
 docs/source/contributing.rst | 22 ++++++++++------------
 1 file changed, 10 insertions(+), 12 deletions(-)

diff --git a/docs/source/contributing.rst b/docs/source/contributing.rst
index 576e3be8b..aa4542ac5 100644
--- a/docs/source/contributing.rst
+++ b/docs/source/contributing.rst
@@ -91,7 +91,14 @@ Create a `conda <https://docs.conda.io>`_ environment for echopype development
     # install echopype in editable mode (setuptools "develop mode")
     # plot is an extra set of requirements that can be used for plotting.
     # the command will install all the dependencies along with plotting dependencies.
-    pip install -e .[plot]
+    pip install -e ".[plot]"
+
+.. note::
+
+    Try using `mamba <https://mamba.readthedocs.io>`_ instead of ``conda``
+    if the ``conda create`` and ``conda install`` step fail or take too long.
+    ``Mamba`` is a drop-in replacement for conda environment creation and package
+    installation that is typically faster than conda.
 
 See the :doc:`installation` page to simply install the latest echopype release from conda or PyPI.
 
@@ -102,13 +109,6 @@ Tests and test infrastructure
 Test data files
 ~~~~~~~~~~~~~~~
 
-.. attention::
-
-    Echopype previously used Git LFS for managing and accessing large test data files.
-    We have deprecated its use starting with echopype version 0.5.0. The files
-    in https://github.com/OSOceanAcoustics/echopype/tree/main/echopype/test_data
-    are also being deprecated.
-
 Test echosounder data files are managed in a private Google Drive folder and
 made available via the `cormorack/http <https://hub.docker.com/r/cormorack/http>`_
 Docker image on Docker hub; the image is rebuilt daily when new test data are added
@@ -181,14 +181,12 @@ The entire test suite can be a bit slow, taking up to 40 minutes or more.
 To mitigate this, the CI default is to run tests only for subpackages that
 were modified in the PR; this is done via ``.ci_helpers/run-test.py``
 (see the `Running the tests`_ section). To have the CI execute the
-entire test suite, add the GitHub label ``Needs Complete Testing`` to the
-PR before submitting it.
-
+entire test suite, add the string "[all tests ci]" to the PR title.
 Under special circumstances, when the submitted changes have a
 very limited scope (such as contributions to the documentation)
 or you know exactly what you're doing
 (you're a seasoned echopype contributor), the CI can be skipped.
-This is done by including the string "[skip ci]" in your last commit's message.
+This is done by adding the string "[skip ci]" to the PR title.
 
 
 Documentation development

From 72909babcfbc567943072c2d7293fcf7665810c4 Mon Sep 17 00:00:00 2001
From: Wu-Jung Lee <leewujung@gmail.com>
Date: Thu, 4 Aug 2022 16:39:23 -0400
Subject: [PATCH 13/23] Add function to interpolate location to calibrated
 dataset (#749)

* add first prototype of add_location

* add simple test

* use test_path directly

* add typing for echodata

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* move add_location to preprocess

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix init

* move add_latlon to subpackage consolidate

* fix test

* Added test for missing and all-nan lon & lat variables. Added support for propagating fixed-location (mooring) lat-lon coordinate

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Emilio Mayorga <emiliomayorga@gmail.com>
---
 echopype/__init__.py                          | 12 +++-
 echopype/consolidate/__init__.py              |  3 +
 echopype/consolidate/api.py                   | 67 +++++++++++++++++++
 .../tests/consolidate/test_consolidate.py     | 20 ++++++
 4 files changed, 100 insertions(+), 2 deletions(-)
 create mode 100644 echopype/consolidate/__init__.py
 create mode 100644 echopype/consolidate/api.py
 create mode 100644 echopype/tests/consolidate/test_consolidate.py

diff --git a/echopype/__init__.py b/echopype/__init__.py
index ca1953a3b..03adaafe2 100644
--- a/echopype/__init__.py
+++ b/echopype/__init__.py
@@ -2,9 +2,17 @@
 
 from _echopype_version import version as __version__  # noqa
 
-from . import calibrate, preprocess, utils
+from . import calibrate, consolidate, preprocess, utils
 from .convert.api import open_raw
 from .echodata.api import open_converted
 from .echodata.combine import combine_echodata
 
-__all__ = ["open_raw", "open_converted", "combine_echodata", "calibrate", "preprocess", "utils"]
+__all__ = [
+    "open_raw",
+    "open_converted",
+    "combine_echodata",
+    "calibrate",
+    "consolidate",
+    "preprocess",
+    "utils",
+]
diff --git a/echopype/consolidate/__init__.py b/echopype/consolidate/__init__.py
new file mode 100644
index 000000000..ec7ca9dec
--- /dev/null
+++ b/echopype/consolidate/__init__.py
@@ -0,0 +1,3 @@
+from .api import add_location
+
+__all__ = ["add_location"]
diff --git a/echopype/consolidate/api.py b/echopype/consolidate/api.py
new file mode 100644
index 000000000..090d3a108
--- /dev/null
+++ b/echopype/consolidate/api.py
@@ -0,0 +1,67 @@
+import datetime
+from typing import Optional
+
+import numpy as np
+import xarray as xr
+
+from ..echodata import EchoData
+
+
+def add_location(ds: xr.Dataset, echodata: EchoData = None, nmea_sentence: Optional[str] = None):
+    """
+    Add geographical location (latitude/longitude) to the Sv dataset.
+
+    This function interpolates the location from the Platform group in the original data file
+    based on the time when the latitude/longitude data are recorded and the time the acoustic
+    data are recorded (`ping_time`).
+
+    Parameters
+    ----------
+    ds : xr.Dataset
+        An Sv or MVBS dataset for which the geographical locations will be added to
+    echodata
+        An `EchoData` object holding the raw data
+    nmea_sentence
+        NMEA sentence to select a subset of location data (optional)
+
+    Returns
+    -------
+    The input dataset with the the location data added
+    """
+
+    def sel_interp(var):
+        # NMEA sentence selection
+        if nmea_sentence:
+            coord_var = echodata["Platform"][var][
+                echodata["Platform"]["sentence_type"] == nmea_sentence
+            ]
+        else:
+            coord_var = echodata["Platform"][var]
+
+        if len(coord_var) == 1:
+            # Propagate single, fixed-location coordinate
+            return xr.DataArray(
+                data=coord_var.values[0] * np.ones(len(ds["ping_time"]), dtype=np.float64),
+                dims=["ping_time"],
+                attrs=coord_var.attrs,
+            )
+        else:
+            # Interpolation. time1 is always associated with location data
+            return coord_var.interp(time1=ds["ping_time"])
+
+    if "longitude" not in echodata["Platform"] or echodata["Platform"]["longitude"].isnull().all():
+        raise ValueError("Coordinate variables not present or all nan")
+
+    interp_ds = ds.copy()
+    interp_ds["latitude"] = sel_interp("latitude")
+    interp_ds["longitude"] = sel_interp("longitude")
+    # Most attributes are attached automatically via interpolation
+    # here we add the history
+    history = (
+        f"{datetime.datetime.utcnow()} +00:00. "
+        "Interpolated or propagated from Platform latitude/longitude."  # noqa
+    )
+    interp_ds["latitude"] = interp_ds["latitude"].assign_attrs({"history": history})
+    interp_ds["longitude"] = interp_ds["longitude"].assign_attrs({"history": history})
+
+    return interp_ds.drop_vars("time1")
diff --git a/echopype/tests/consolidate/test_consolidate.py b/echopype/tests/consolidate/test_consolidate.py
new file mode 100644
index 000000000..e1cfdbcf6
--- /dev/null
+++ b/echopype/tests/consolidate/test_consolidate.py
@@ -0,0 +1,20 @@
+import echopype as ep
+
+
+def test_add_location(test_path):
+    ed = ep.open_raw(
+        test_path["EK60"] / "Winter2017-D20170115-T150122.raw",
+        sonar_model="EK60"
+    )
+    ds = ep.calibrate.compute_Sv(ed)
+
+    def _check_var(ds_test):
+        assert "latitude" in ds_test
+        assert "longitude" in ds_test
+        assert "time1" not in ds_test
+
+    ds_all = ep.consolidate.add_location(ds=ds, echodata=ed)
+    _check_var(ds_all)
+
+    ds_sel = ep.consolidate.add_location(ds=ds, echodata=ed, nmea_sentence="GGA")
+    _check_var(ds_sel)

From ad509db895a80bf435969919fa56cece73972c68 Mon Sep 17 00:00:00 2001
From: b-reyes <53541061+b-reyes@users.noreply.github.com>
Date: Fri, 5 Aug 2022 11:56:59 -0700
Subject: [PATCH 14/23] change long_name in ds_power for EK80 (#771)

---
 echopype/convert/set_groups_ek80.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/echopype/convert/set_groups_ek80.py b/echopype/convert/set_groups_ek80.py
index 9b279072c..e8ce4f047 100644
--- a/echopype/convert/set_groups_ek80.py
+++ b/echopype/convert/set_groups_ek80.py
@@ -648,7 +648,7 @@ def _assemble_ds_power(self, ch):
                 "backscatter_r": (
                     ["ping_time", "range_sample"],
                     self.parser_obj.ping_data_dict["power"][ch],
-                    {"long_name": "Backscattering power", "units": "dB"},
+                    {"long_name": "Backscatter power", "units": "dB"},
                 ),
             },
             coords={

From a020d71dde5978c98d1f36c69e733d5047d6af9c Mon Sep 17 00:00:00 2001
From: Don Setiawan <landungs@uw.edu>
Date: Tue, 9 Aug 2022 12:10:19 -0700
Subject: [PATCH 15/23] Try pinning xarray to previous version (#775)

---
 .ci_helpers/py3.10.yaml | 2 +-
 .ci_helpers/py3.8.yaml  | 2 +-
 .ci_helpers/py3.9.yaml  | 2 +-
 requirements.txt        | 2 +-
 4 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/.ci_helpers/py3.10.yaml b/.ci_helpers/py3.10.yaml
index 5b7e99d61..fd2585b91 100644
--- a/.ci_helpers/py3.10.yaml
+++ b/.ci_helpers/py3.10.yaml
@@ -8,7 +8,7 @@ dependencies:
   - pynmea2
   - pytz
   - scipy
-  - xarray
+  - xarray==2022.3.0
   - zarr
   - fsspec
   - s3fs==2022.5.0
diff --git a/.ci_helpers/py3.8.yaml b/.ci_helpers/py3.8.yaml
index 2597b6a3c..3623c47d3 100644
--- a/.ci_helpers/py3.8.yaml
+++ b/.ci_helpers/py3.8.yaml
@@ -8,7 +8,7 @@ dependencies:
   - pynmea2
   - pytz
   - scipy
-  - xarray
+  - xarray==2022.3.0
   - zarr
   - fsspec
   - s3fs==2022.5.0
diff --git a/.ci_helpers/py3.9.yaml b/.ci_helpers/py3.9.yaml
index 2a0090659..41ffe9f7d 100644
--- a/.ci_helpers/py3.9.yaml
+++ b/.ci_helpers/py3.9.yaml
@@ -8,7 +8,7 @@ dependencies:
   - pynmea2
   - pytz
   - scipy
-  - xarray
+  - xarray==2022.3.0
   - zarr
   - fsspec
   - s3fs==2022.5.0
diff --git a/requirements.txt b/requirements.txt
index 2b4a3a6db..772214e73 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -5,7 +5,7 @@ numpy
 pynmea2
 pytz
 scipy
-xarray
+xarray==2022.3.0
 zarr
 fsspec
 s3fs

From de13aeae82053872f8e210f2cc3d6b57b49c88fe Mon Sep 17 00:00:00 2001
From: Don Setiawan <landungs@uw.edu>
Date: Wed, 10 Aug 2022 14:36:36 -0700
Subject: [PATCH 16/23] Overhaul access pattern [all tests ci] (#762)

* Update access patterns for convert api

* Update access pattern for 'Environment'

* Update access pattern for 'Platform'

* Update access pattern for 'Provenance'

* Update access pattern for 'Vendor_specific'

* Update access pattern for 'Sonar/Beam_group2'

* Update access pattern for 'Sonar/Beam_group1' and others

* Remove getattrs

* Update echopype/tests/echodata/test_echodata.py

Co-authored-by: Don Setiawan <landungs@uw.edu>

* Update echopype/tests/echodata/test_echodata_combine.py

Co-authored-by: Don Setiawan <landungs@uw.edu>

* Modify getitem to return None and remove try/except

* Add spec comment

* Update echopype/tests/echodata/test_echodata_combine.py

Co-authored-by: Wu-Jung Lee <leewujung@gmail.com>

Co-authored-by: Wu-Jung Lee <leewujung@gmail.com>
---
 echopype/calibrate/api.py                     |   4 +-
 echopype/calibrate/calibrate_azfp.py          |  22 ++-
 echopype/calibrate/calibrate_ek.py            | 154 +++++++++++-------
 echopype/calibrate/env_params.py              |  16 +-
 echopype/convert/api.py                       |  31 ++--
 echopype/echodata/combine.py                  |  37 +++--
 echopype/echodata/echodata.py                 |  76 ++++-----
 echopype/tests/calibrate/test_calibrate.py    |   6 +-
 echopype/tests/convert/test_convert_azfp.py   |  20 +--
 echopype/tests/convert/test_convert_ek60.py   |  12 +-
 echopype/tests/convert/test_convert_ek80.py   |  34 ++--
 .../test_convert_source_target_locs.py        |  58 +++----
 echopype/tests/echodata/test_echodata.py      |  93 +++++------
 .../tests/echodata/test_echodata_combine.py   |  66 +++++---
 echopype/tests/preprocess/test_preprocess.py  |   2 +-
 echopype/tests/visualize/test_plot.py         |  18 +-
 echopype/visualize/api.py                     |   4 +-
 17 files changed, 336 insertions(+), 317 deletions(-)

diff --git a/echopype/calibrate/api.py b/echopype/calibrate/api.py
index 4136e8134..6c5805ad7 100644
--- a/echopype/calibrate/api.py
+++ b/echopype/calibrate/api.py
@@ -104,9 +104,9 @@ def add_attrs(cal_type, ds):
     prov_dict["processing_function"] = f"calibrate.compute_{cal_type}"
     cal_ds = cal_ds.assign_attrs(prov_dict)
 
-    if "water_level" in echodata.platform.data_vars.keys():
+    if "water_level" in echodata["Platform"].data_vars.keys():
         # add water_level to the created xr.Dataset
-        cal_ds["water_level"] = echodata.platform.water_level
+        cal_ds["water_level"] = echodata["Platform"].water_level
 
     return cal_ds
 
diff --git a/echopype/calibrate/calibrate_azfp.py b/echopype/calibrate/calibrate_azfp.py
index ca01beb35..b61c79c7f 100644
--- a/echopype/calibrate/calibrate_azfp.py
+++ b/echopype/calibrate/calibrate_azfp.py
@@ -34,13 +34,15 @@ def get_cal_params(self, cal_params):
         self.cal_params["equivalent_beam_angle"] = (
             cal_params["equivalent_beam_angle"]
             if "equivalent_beam_angle" in cal_params
-            else self.echodata.beam["equivalent_beam_angle"]
+            else self.echodata["Sonar/Beam_group1"]["equivalent_beam_angle"]
         )
 
         # Get params from the Vendor_specific group
         for p in ["EL", "DS", "TVR", "VTX", "Sv_offset"]:
             # substitute if None in user input
-            self.cal_params[p] = cal_params[p] if p in cal_params else self.echodata.vendor[p]
+            self.cal_params[p] = (
+                cal_params[p] if p in cal_params else self.echodata["Vendor_specific"][p]
+            )
 
     def get_env_params(self):
         """Get env params using user inputs or values from data file.
@@ -53,7 +55,7 @@ def get_env_params(self):
         self.env_params["temperature"] = (
             self.env_params["temperature"]
             if "temperature" in self.env_params
-            else self.echodata.environment["temperature"]
+            else self.echodata["Environment"]["temperature"]
         )
 
         # Salinity and pressure always come from user input
@@ -71,7 +73,7 @@ def get_env_params(self):
             formula_source="AZFP",
         )
         self.env_params["sound_absorption"] = uwa.calc_absorption(
-            frequency=self.echodata.beam["frequency_nominal"],
+            frequency=self.echodata["Sonar/Beam_group1"]["frequency_nominal"],
             temperature=self.env_params["temperature"],
             salinity=self.env_params["salinity"],
             pressure=self.env_params["pressure"],
@@ -108,10 +110,10 @@ def _cal_power(self, cal_type, **kwargs):
         # Compute derived params
 
         # Harmonize time coordinate between Beam_groupX data and env_params
-        # Use self.echodata.beam because complex sample is always in Beam_group1
+        # Use self.echodata["Sonar/Beam_group1"] because complex sample is always in Beam_group1
         for p in self.env_params.keys():
             self.env_params[p] = self.echodata._harmonize_env_param_time(
-                self.env_params[p], ping_time=self.echodata.beam.ping_time
+                self.env_params[p], ping_time=self.echodata["Sonar/Beam_group1"].ping_time
             )
 
         # TODO: take care of dividing by zero encountered in log10
@@ -122,7 +124,9 @@ def _cal_power(self, cal_type, **kwargs):
         # scaling factor (slope) in Fig.G-1, units Volts/dB], see p.84
         a = self.cal_params["DS"]
         EL = (
-            self.cal_params["EL"] - 2.5 / a + self.echodata.beam.backscatter_r / (26214 * a)
+            self.cal_params["EL"]
+            - 2.5 / a
+            + self.echodata["Sonar/Beam_group1"].backscatter_r / (26214 * a)
         )  # eq.(5)  # has beam dim due to backscatter_r
 
         if cal_type == "Sv":
@@ -136,7 +140,7 @@ def _cal_power(self, cal_type, **kwargs):
                 * np.log10(
                     0.5
                     * self.env_params["sound_speed"]
-                    * self.echodata.beam["transmit_duration_nominal"]
+                    * self.echodata["Sonar/Beam_group1"]["transmit_duration_nominal"]
                     * self.cal_params["equivalent_beam_angle"]
                 )
                 + self.cal_params["Sv_offset"]
@@ -155,7 +159,7 @@ def _cal_power(self, cal_type, **kwargs):
         out = out.merge(self.range_meter)
 
         # Add frequency_nominal to data set
-        out["frequency_nominal"] = self.echodata.beam["frequency_nominal"]
+        out["frequency_nominal"] = self.echodata["Sonar/Beam_group1"]["frequency_nominal"]
 
         # Add env and cal parameters
         out = self._add_params_to_output(out)
diff --git a/echopype/calibrate/calibrate_ek.py b/echopype/calibrate/calibrate_ek.py
index a08b7fe04..42eacf3ba 100644
--- a/echopype/calibrate/calibrate_ek.py
+++ b/echopype/calibrate/calibrate_ek.py
@@ -54,7 +54,7 @@ def _get_vend_cal_params_power(self, param, waveform_mode):
         param : str {"sa_correction", "gain_correction"}
             name of parameter to retrieve
         """
-        ds_vend = self.echodata.vendor
+        ds_vend = self.echodata["Vendor_specific"]
 
         if ds_vend is None or param not in ds_vend:
             return None
@@ -62,10 +62,10 @@ def _get_vend_cal_params_power(self, param, waveform_mode):
         if param not in ["sa_correction", "gain_correction"]:
             raise ValueError(f"Unknown parameter {param}")
 
-        if waveform_mode == "CW" and self.echodata.beam_power is not None:
-            beam = self.echodata.beam_power
+        if waveform_mode == "CW" and self.echodata["Sonar/Beam_group2"] is not None:
+            beam = self.echodata["Sonar/Beam_group2"]
         else:
-            beam = self.echodata.beam
+            beam = self.echodata["Sonar/Beam_group1"]
 
         # indexes of frequencies that are for power, not complex
         relevant_indexes = np.where(
@@ -105,11 +105,11 @@ def get_cal_params(self, cal_params, waveform_mode, encode_mode):
         if (
             encode_mode == "power"
             and waveform_mode == "CW"
-            and self.echodata.beam_power is not None
+            and self.echodata["Sonar/Beam_group2"] is not None
         ):
-            beam = self.echodata.beam_power
+            beam = self.echodata["Sonar/Beam_group2"]
         else:
-            beam = self.echodata.beam
+            beam = self.echodata["Sonar/Beam_group1"]
 
         # Params from the Vendor_specific group
 
@@ -141,8 +141,9 @@ def _cal_power(self, cal_type, use_beam_power=False) -> xr.Dataset:
             'TS' for calculating target strength
         use_beam_power : bool
             whether to use beam_power.
-            If ``True`` use ``echodata.beam_power``; if ``False`` use ``echodata.beam``.
-            Note ``echodata.beam_power`` could only exist for EK80 data.
+            If ``True`` use ``echodata["Sonar/Beam_group2"]``;
+            if ``False`` use ``echodata["Sonar/Beam_group1"]``.
+            Note ``echodata["Sonar/Beam_group2"]`` could only exist for EK80 data.
 
         Returns
         -------
@@ -151,9 +152,9 @@ def _cal_power(self, cal_type, use_beam_power=False) -> xr.Dataset:
         """
         # Select source of backscatter data
         if use_beam_power:
-            beam = self.echodata.beam_power
+            beam = self.echodata["Sonar/Beam_group2"]
         else:
-            beam = self.echodata.beam
+            beam = self.echodata["Sonar/Beam_group1"]
 
         # Harmonize time coordinate between Beam_groupX data and env_params
         for p in self.env_params.keys():
@@ -261,7 +262,7 @@ def get_env_params(self, **kwargs):
                 pressure=self.env_params["pressure"],
             )
             self.env_params["sound_absorption"] = uwa.calc_absorption(
-                frequency=self.echodata.beam["frequency_nominal"],
+                frequency=self.echodata["Sonar/Beam_group1"]["frequency_nominal"],
                 temperature=self.env_params["temperature"],
                 salinity=self.env_params["salinity"],
                 pressure=self.env_params["pressure"],
@@ -271,12 +272,12 @@ def get_env_params(self, **kwargs):
             self.env_params["sound_speed"] = (
                 self.env_params["sound_speed"]
                 if "sound_speed" in self.env_params
-                else self.echodata.environment["sound_speed_indicative"]
+                else self.echodata["Environment"]["sound_speed_indicative"]
             )
             self.env_params["sound_absorption"] = (
                 self.env_params["sound_absorption"]
                 if "sound_absorption" in self.env_params
-                else self.echodata.environment["absorption_indicative"]
+                else self.echodata["Environment"]["absorption_indicative"]
             )
 
     def compute_Sv(self, **kwargs):
@@ -340,11 +341,11 @@ def get_env_params(self, waveform_mode=None, encode_mode="complex"):
         if (
             encode_mode == "power"
             and waveform_mode == "CW"
-            and self.echodata.beam_power is not None
+            and self.echodata["Sonar/Beam_group2"] is not None
         ):
-            beam = self.echodata.beam_power
+            beam = self.echodata["Sonar/Beam_group2"]
         else:
-            beam = self.echodata.beam
+            beam = self.echodata["Sonar/Beam_group1"]
 
         # Use center frequency if in BB mode, else use nominal channel frequency
         if waveform_mode == "BB":
@@ -380,12 +381,14 @@ def get_env_params(self, waveform_mode=None, encode_mode="complex"):
                 ["temperature", "salinity", "depth"],
             ):
                 self.env_params[p1] = (
-                    self.env_params[p1] if p1 in self.env_params else self.echodata.environment[p2]
+                    self.env_params[p1]
+                    if p1 in self.env_params
+                    else self.echodata["Environment"][p2]
                 )
             self.env_params["sound_speed"] = (
                 self.env_params["sound_speed"]
                 if "sound_speed" in self.env_params
-                else self.echodata.environment["sound_speed_indicative"]
+                else self.echodata["Environment"]["sound_speed_indicative"]
             )
             self.env_params["sound_absorption"] = (
                 self.env_params["sound_absorption"]
@@ -411,16 +414,18 @@ def _get_vend_cal_params_complex(self, channel_id, filter_name, param_type):
             'coeff' or 'decimation'
         """
         if param_type == "coeff":
-            v = self.echodata.vendor.attrs[
+            v = self.echodata["Vendor_specific"].attrs[
                 "%s %s filter_r" % (channel_id, filter_name)
             ] + 1j * np.array(
-                self.echodata.vendor.attrs["%s %s filter_i" % (channel_id, filter_name)]
+                self.echodata["Vendor_specific"].attrs["%s %s filter_i" % (channel_id, filter_name)]
             )
             if v.size == 1:
                 v = np.expand_dims(v, axis=0)  # expand dims for convolution
             return v
         else:
-            return self.echodata.vendor.attrs["%s %s decimation" % (channel_id, filter_name)]
+            return self.echodata["Vendor_specific"].attrs[
+                "%s %s decimation" % (channel_id, filter_name)
+            ]
 
     def _tapered_chirp(
         self,
@@ -513,15 +518,15 @@ def get_transmit_chirp(self, waveform_mode):
         """
         # Make sure it is BB mode data
         if waveform_mode == "BB" and (
-            ("frequency_start" not in self.echodata.beam)
-            or ("frequency_end" not in self.echodata.beam)
+            ("frequency_start" not in self.echodata["Sonar/Beam_group1"])
+            or ("frequency_end" not in self.echodata["Sonar/Beam_group1"])
         ):
             raise TypeError("File does not contain BB mode complex samples!")
 
         y_all = {}
         y_time_all = {}
         tau_effective = {}
-        for chan in self.echodata.beam.channel.values:
+        for chan in self.echodata["Sonar/Beam_group1"].channel.values:
             # TODO: currently only deal with the case with
             # a fixed tx key param values within a channel
             if waveform_mode == "BB":
@@ -541,13 +546,15 @@ def get_transmit_chirp(self, waveform_mode):
                 ]
             tx_params = {}
             for p in tx_param_names:
-                tx_params[p] = np.unique(self.echodata.beam[p].sel(channel=chan))
+                tx_params[p] = np.unique(self.echodata["Sonar/Beam_group1"][p].sel(channel=chan))
                 if tx_params[p].size != 1:
                     raise TypeError("File contains changing %s!" % p)
             y_tmp, _ = self._tapered_chirp(**tx_params)
 
             # Filter and decimate chirp template
-            fs_deci = 1 / self.echodata.beam.sel(channel=chan)["sample_interval"].values
+            fs_deci = (
+                1 / self.echodata["Sonar/Beam_group1"].sel(channel=chan)["sample_interval"].values
+            )
             y_tmp, y_tmp_time = self._filter_decimate_chirp(y_tmp, chan)
 
             # Compute effective pulse length
@@ -570,9 +577,9 @@ def compress_pulse(self, chirp, chan_BB=None):
             channels that transmit in BB mode
             (since CW mode can be in mixed in complex samples too)
         """
-        backscatter = self.echodata.beam["backscatter_r"].sel(
+        backscatter = self.echodata["Sonar/Beam_group1"]["backscatter_r"].sel(
             channel=chan_BB
-        ) + 1j * self.echodata.beam["backscatter_i"].sel(channel=chan_BB)
+        ) + 1j * self.echodata["Sonar/Beam_group1"]["backscatter_i"].sel(channel=chan_BB)
 
         pc_all = []
         for chan in chan_BB:
@@ -622,26 +629,28 @@ def _get_gain_for_complex(self, waveform_mode, chan_sel) -> xr.DataArray:
                 "gain_correction", waveform_mode=waveform_mode
             )
             gain = []
-            if "gain" in self.echodata.vendor.data_vars:
+            if "gain" in self.echodata["Vendor_specific"].data_vars:
                 # index using channel_id as order of frequency across channel can be arbitrary
                 # reference to freq_center in case some channels are CW complex samples
                 # (already dropped when computing freq_center in the calling function)
                 for ch_id in chan_sel:
                     # if channel gain exists in data
-                    if ch_id in self.echodata.vendor.cal_channel_id:
-                        gain_vec = self.echodata.vendor.gain.sel(cal_channel_id=ch_id)
+                    if ch_id in self.echodata["Vendor_specific"].cal_channel_id:
+                        gain_vec = self.echodata["Vendor_specific"].gain.sel(cal_channel_id=ch_id)
                         gain_temp = (
                             gain_vec.interp(
-                                cal_frequency=self.echodata.vendor.frequency_nominal.sel(
-                                    channel=ch_id
-                                )
+                                cal_frequency=self.echodata[
+                                    "Vendor_specific"
+                                ].frequency_nominal.sel(channel=ch_id)
                             ).drop(["cal_channel_id", "cal_frequency"])
                         ).expand_dims("channel")
                     # if no freq-dependent gain use CW gain
                     else:
                         gain_temp = (
                             gain_single.sel(channel=ch_id)
-                            .reindex_like(self.echodata.beam.backscatter_r, method="nearest")
+                            .reindex_like(
+                                self.echodata["Sonar/Beam_group1"].backscatter_r, method="nearest"
+                            )
                             .expand_dims("channel")
                         )
                     gain_temp.name = "gain"
@@ -682,9 +691,13 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
 
         # use center frequency for each ping to select BB or CW channels
         # when all samples are encoded as complex samples
-        if "frequency_start" in self.echodata.beam and "frequency_end" in self.echodata.beam:
+        if (
+            "frequency_start" in self.echodata["Sonar/Beam_group1"]
+            and "frequency_end" in self.echodata["Sonar/Beam_group1"]
+        ):
             freq_center = (
-                self.echodata.beam["frequency_start"] + self.echodata.beam["frequency_end"]
+                self.echodata["Sonar/Beam_group1"]["frequency_start"]
+                + self.echodata["Sonar/Beam_group1"]["frequency_end"]
             ) / 2  # has beam dim
         else:
             freq_center = None
@@ -702,7 +715,7 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
             # backscatter data
             pc = self.compress_pulse(chirp, chan_BB=chan_sel)  # has beam dim
             prx = (
-                self.echodata.beam.beam.size
+                self.echodata["Sonar/Beam_group1"].beam.size
                 * np.abs(pc.mean(dim="beam")) ** 2
                 / (2 * np.sqrt(2)) ** 2
                 * (np.abs(self.z_er + self.z_et) / self.z_er) ** 2
@@ -711,7 +724,7 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
         else:
             if freq_center is None:
                 # when only have CW complex samples
-                chan_sel = self.echodata.beam.channel
+                chan_sel = self.echodata["Sonar/Beam_group1"].channel
             else:
                 # if BB and CW complex samples co-exist
                 # drop those that contain BB samples (not nan in freq start/end)
@@ -719,10 +732,11 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
 
             # backscatter data
             backscatter_cw = (
-                self.echodata.beam["backscatter_r"] + 1j * self.echodata.beam["backscatter_i"]
+                self.echodata["Sonar/Beam_group1"]["backscatter_r"]
+                + 1j * self.echodata["Sonar/Beam_group1"]["backscatter_i"]
             )
             prx = (
-                self.echodata.beam.beam.size
+                self.echodata["Sonar/Beam_group1"].beam.size
                 * np.abs(backscatter_cw.mean(dim="beam")) ** 2
                 / (2 * np.sqrt(2)) ** 2
                 * (np.abs(self.z_er + self.z_et) / self.z_er) ** 2
@@ -734,10 +748,10 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
         # Compute derived params
 
         # Harmonize time coordinate between Beam_groupX data and env_params
-        # Use self.echodata.beam because complex sample is always in Beam_group1
+        # Use self.echodata["Sonar/Beam_group1"] because complex sample is always in Beam_group1
         for p in self.env_params.keys():
             self.env_params[p] = self.echodata._harmonize_env_param_time(
-                self.env_params[p], ping_time=self.echodata.beam.ping_time
+                self.env_params[p], ping_time=self.echodata["Sonar/Beam_group1"].ping_time
             )
 
         sound_speed = self.env_params["sound_speed"]
@@ -745,14 +759,18 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
         range_meter = self.range_meter.sel(channel=chan_sel)
         if waveform_mode == "BB":
             # use true center frequency for BB pulse
-            wavelength = sound_speed / self.echodata.beam.frequency_nominal.sel(channel=chan_sel)
+            wavelength = sound_speed / self.echodata["Sonar/Beam_group1"].frequency_nominal.sel(
+                channel=chan_sel
+            )
 
             # use true center frequency to interpolate for gain factor
             gain = self._get_gain_for_complex(waveform_mode=waveform_mode, chan_sel=chan_sel)
 
         else:
             # use nominal channel frequency for CW pulse
-            wavelength = sound_speed / self.echodata.beam.frequency_nominal.sel(channel=chan_sel)
+            wavelength = sound_speed / self.echodata["Sonar/Beam_group1"].frequency_nominal.sel(
+                channel=chan_sel
+            )
 
             # use nominal channel frequency to select gain factor
             gain = self._get_gain_for_complex(waveform_mode=waveform_mode, chan_sel=chan_sel)
@@ -767,21 +785,29 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
             # effective pulse length
             tau_effective = xr.DataArray(
                 data=list(tau_effective.values()),
-                coords=[self.echodata.beam.channel, self.echodata.beam.ping_time],
+                coords=[
+                    self.echodata["Sonar/Beam_group1"].channel,
+                    self.echodata["Sonar/Beam_group1"].ping_time,
+                ],
                 dims=["channel", "ping_time"],
             ).sel(channel=chan_sel)
 
             # other params
-            transmit_power = self.echodata.beam["transmit_power"].sel(channel=chan_sel)
+            transmit_power = self.echodata["Sonar/Beam_group1"]["transmit_power"].sel(
+                channel=chan_sel
+            )
             # equivalent_beam_angle has beam dim
             if waveform_mode == "BB":
-                psifc = self.echodata.beam["equivalent_beam_angle"].sel(
+                psifc = self.echodata["Sonar/Beam_group1"]["equivalent_beam_angle"].sel(
                     channel=chan_sel
                 ) + 10 * np.log10(
-                    self.echodata.vendor.frequency_nominal.sel(channel=chan_sel) / freq_center
+                    self.echodata["Vendor_specific"].frequency_nominal.sel(channel=chan_sel)
+                    / freq_center
                 )
             elif waveform_mode == "CW":
-                psifc = self.echodata.beam["equivalent_beam_angle"].sel(channel=chan_sel)
+                psifc = self.echodata["Sonar/Beam_group1"]["equivalent_beam_angle"].sel(
+                    channel=chan_sel
+                )
 
             out = (
                 10 * np.log10(prx)
@@ -795,7 +821,9 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
             out = out.rename_vars({list(out.data_vars.keys())[0]: "Sv"})
 
         elif cal_type == "TS":
-            transmit_power = self.echodata.beam["transmit_power"].sel(channel=chan_sel)
+            transmit_power = self.echodata["Sonar/Beam_group1"]["transmit_power"].sel(
+                channel=chan_sel
+            )
 
             out = (
                 10 * np.log10(prx)
@@ -810,7 +838,7 @@ def _cal_complex(self, cal_type, waveform_mode) -> xr.Dataset:
         out = out.merge(range_meter)
 
         # Add frequency_nominal to data set
-        out["frequency_nominal"] = self.echodata.beam["frequency_nominal"]
+        out["frequency_nominal"] = self.echodata["Sonar/Beam_group1"]["frequency_nominal"]
 
         # Add env and cal parameters
         out = self._add_params_to_output(out)
@@ -879,22 +907,22 @@ def _compute_cal(self, cal_type, waveform_mode, encode_mode) -> xr.Dataset:
         # Raise error when waveform_mode and actual recording mode do not match
         # This simple check is only possible for BB-only data,
         #   since for data with both BB and CW complex samples,
-        #   frequency_start will exist in echodata.beam for the BB channels
-        if waveform_mode == "BB" and "frequency_start" not in self.echodata.beam:
+        #   frequency_start will exist in echodata["Sonar/Beam_group1"] for the BB channels
+        if waveform_mode == "BB" and "frequency_start" not in self.echodata["Sonar/Beam_group1"]:
             raise ValueError("waveform_mode='BB' but broadband data not found!")
 
         # Set use_beam_power
-        #  - True: use self.echodata.beam_power for cal
-        #  - False: use self.echodata.beam for cal
+        #  - True: use self.echodata["Sonar/Beam_group2"] for cal
+        #  - False: use self.echodata["Sonar/Beam_group1"] for cal
         use_beam_power = False
 
         # Warn user about additional data in the raw file if another type exists
         # When both power and complex samples exist:
-        #   complex samples will be stored in echodata.beam
-        #   power samples will be stored in echodata.beam_power
+        #   complex samples will be stored in echodata["Sonar/Beam_group1"]
+        #   power samples will be stored in echodata["Sonar/Beam_group2"]
         # When only one type of samples exist,
-        #   all samples with be stored in echodata.beam
-        if self.echodata.beam_power is not None:  # both power and complex samples exist
+        #   all samples with be stored in echodata["Sonar/Beam_group1"]
+        if self.echodata["Sonar/Beam_group2"] is not None:  # both power and complex samples exist
             # If both beam and beam_power groups exist,
             #   this means that CW data are encoded as power samples and in beam_power group
             if waveform_mode == "CW" and encode_mode == "complex":
@@ -910,7 +938,9 @@ def _compute_cal(self, cal_type, waveform_mode, encode_mode) -> xr.Dataset:
                     "Only complex samples are calibrated, but power samples also exist in the raw data file!"  # noqa
                 )
         else:  # only power OR complex samples exist
-            if "backscatter_i" in self.echodata.beam.variables:  # data contain only complex samples
+            if (
+                "backscatter_i" in self.echodata["Sonar/Beam_group1"].variables
+            ):  # data contain only complex samples
                 if encode_mode == "power":
                     raise TypeError(
                         "File does not contain power samples! Use encode_mode='complex'"
diff --git a/echopype/calibrate/env_params.py b/echopype/calibrate/env_params.py
index 5856ccc76..ba6f2d588 100644
--- a/echopype/calibrate/env_params.py
+++ b/echopype/calibrate/env_params.py
@@ -95,7 +95,7 @@ def _apply(self, echodata) -> Dict[str, xr.DataArray]:
             raise ValueError("invalid data_kind")
 
         for dim in dims:
-            if dim not in echodata.platform:
+            if dim not in echodata["Platform"]:
                 raise ValueError(
                     f"could not interpolate env_params; EchoData is missing dimension {dim}"
                 )
@@ -103,10 +103,12 @@ def _apply(self, echodata) -> Dict[str, xr.DataArray]:
         env_params = self.env_params
 
         if self.data_kind == "mobile":
-            if np.isnan(echodata.platform["time1"]).all():
+            if np.isnan(echodata["Platform"]["time1"]).all():
                 raise ValueError("cannot perform mobile interpolation without time1")
             # compute_range needs indexing by ping_time
-            interp_plat = echodata.platform.interp({"time1": echodata.beam["ping_time"]})
+            interp_plat = echodata["Platform"].interp(
+                {"time1": echodata["Sonar/Beam_group1"]["ping_time"]}
+            )
 
             result = {}
             for var, values in env_params.data_vars.items():
@@ -134,7 +136,7 @@ def _apply(self, echodata) -> Dict[str, xr.DataArray]:
             }
 
             extrap = env_params.interp(
-                {dim: echodata.platform[dim].data for dim in dims},
+                {dim: echodata["Platform"][dim].data for dim in dims},
                 method=self.extrap_method,
                 # scipy interp uses "extrapolate" but scipy interpn uses None
                 kwargs={"fill_value": "extrapolate" if len(dims) == 1 else None},
@@ -143,7 +145,7 @@ def _apply(self, echodata) -> Dict[str, xr.DataArray]:
             extrap_unique_idx = {dim: np.unique(extrap[dim], return_index=True)[1] for dim in dims}
             extrap = extrap.isel(**extrap_unique_idx)
             interp = env_params.interp(
-                {dim: echodata.platform[dim].data for dim in dims},
+                {dim: echodata["Platform"][dim].data for dim in dims},
                 method=self.interp_method,
             )
             interp_unique_idx = {dim: np.unique(interp[dim], return_index=True)[1] for dim in dims}
@@ -179,8 +181,8 @@ def _apply(self, echodata) -> Dict[str, xr.DataArray]:
 
         # if self.data_kind == "organized":
         #     # get platform latitude and longitude indexed by ping_time
-        #     interp_plat = echodata.platform.interp(
-        #         {"time": echodata.platform["ping_time"]}
+        #     interp_plat = echodata["Platform"].interp(
+        #         {"time": echodata["Platform"]["ping_time"]}
         #     )
         #     # get env_params latitude and longitude indexed by ping_time
         #     env_params = env_params.interp(
diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index c9d9fe9be..3e04e31e5 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -105,11 +105,11 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     # TODO: in terms of chunking, would using rechunker at the end be faster and more convenient?
 
     # Top-level group
-    io.save_file(echodata.top, path=output_path, mode="w", engine=engine)
+    io.save_file(echodata["Top-level"], path=output_path, mode="w", engine=engine)
 
     # Provenance group
     io.save_file(
-        echodata.provenance,
+        echodata["Provenance"],
         path=output_path,
         group="Provenance",
         mode="a",
@@ -117,9 +117,9 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     )
 
     # Environment group
-    if "time1" in echodata.environment:
+    if "time1" in echodata["Environment"]:
         io.save_file(
-            echodata.environment.chunk(
+            echodata["Environment"].chunk(
                 {"time1": DEFAULT_CHUNK_SIZE["ping_time"]}
             ),  # TODO: chunking necessary?
             path=output_path,
@@ -129,7 +129,7 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
         )
     else:
         io.save_file(
-            echodata.environment,
+            echodata["Environment"],
             path=output_path,
             mode="a",
             engine=engine,
@@ -138,7 +138,7 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
 
     # Sonar group
     io.save_file(
-        echodata.sonar,
+        echodata["Sonar"],
         path=output_path,
         group="Sonar",
         mode="a",
@@ -162,7 +162,7 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
             )
     else:
         io.save_file(
-            echodata.beam.chunk(
+            echodata[f"Sonar/{BEAM_SUBGROUP_DEFAULT}"].chunk(
                 {
                     "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
                     "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
@@ -174,9 +174,10 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
             group=f"Sonar/{BEAM_SUBGROUP_DEFAULT}",
             compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
         )
-        if echodata.beam_power is not None:
+        if echodata["Sonar/Beam_group2"] is not None:
+            # some sonar model does not produce Sonar/Beam_group2
             io.save_file(
-                echodata.beam_power.chunk(
+                echodata["Sonar/Beam_group2"].chunk(
                     {
                         "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
                         "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
@@ -191,7 +192,7 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
 
     # Platform group
     io.save_file(
-        echodata.platform,  # TODO: chunking necessary? time1 and time2 (EK80) only
+        echodata["Platform"],  # TODO: chunking necessary? time1 and time2 (EK80) only
         path=output_path,
         mode="a",
         engine=engine,
@@ -200,9 +201,9 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     )
 
     # Platform/NMEA group: some sonar model does not produce NMEA data
-    if echodata.nmea is not None:
+    if echodata["Platform/NMEA"] is not None:
         io.save_file(
-            echodata.nmea,  # TODO: chunking necessary?
+            echodata["Platform/NMEA"],  # TODO: chunking necessary?
             path=output_path,
             mode="a",
             engine=engine,
@@ -211,9 +212,9 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
         )
 
     # Vendor_specific group
-    if "ping_time" in echodata.vendor:
+    if "ping_time" in echodata["Vendor_specific"]:
         io.save_file(
-            echodata.vendor.chunk(
+            echodata["Vendor_specific"].chunk(
                 {"ping_time": DEFAULT_CHUNK_SIZE["ping_time"]}
             ),  # TODO: chunking necessary?
             path=output_path,
@@ -224,7 +225,7 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
         )
     else:
         io.save_file(
-            echodata.vendor,  # TODO: chunking necessary?
+            echodata["Vendor_specific"],  # TODO: chunking necessary?
             path=output_path,
             mode="a",
             engine=engine,
diff --git a/echopype/echodata/combine.py b/echopype/echodata/combine.py
index a9b679ddb..244a9da5f 100644
--- a/echopype/echodata/combine.py
+++ b/echopype/echodata/combine.py
@@ -176,14 +176,20 @@ def combine_echodata(echodatas: List[EchoData], combine_attrs="override") -> Ech
     # { group1: [echodata1 attrs, echodata2 attrs, ...], ... }
     old_attrs: Dict[str, List[Dict[str, Any]]] = dict()
 
+    # Specification for Echodata.group_map can be found in
+    # echopype/echodata/convention/1.0.yml
     for group, value in EchoData.group_map.items():
-        group_datasets = [
-            getattr(echodata, group)
-            for echodata in echodatas
-            if getattr(echodata, group) is not None
-        ]
+        group_datasets = []
+        group_path = value["ep_group"]
+        if group_path is None:
+            group_path = "Top-level"
+
+        for echodata in echodatas:
+            if echodata[group_path] is not None:
+                group_datasets.append(echodata[group_path])
+
         if group in ("top", "sonar"):
-            combined_group = getattr(echodatas[0], group)
+            combined_group = echodatas[0][group_path]
         elif group == "provenance":
             combined_group = assemble_combined_provenance(
                 [
@@ -195,7 +201,6 @@ def combine_echodata(echodatas: List[EchoData], combine_attrs="override") -> Ech
             )
         else:
             if len(group_datasets) == 0:
-                setattr(result, group, None)
                 continue
 
             concat_dim = SONAR_MODELS[sonar_model]["concat_dims"].get(
@@ -265,20 +270,20 @@ def combine_echodata(echodatas: List[EchoData], combine_attrs="override") -> Ech
 
     # save ping time before reversal correction
     if old_ping_time is not None:
-        result.provenance["old_ping_time"] = old_ping_time
-        result.provenance.attrs["reversed_ping_times"] = 1
+        result["Provenance"]["old_ping_time"] = old_ping_time
+        result["Provenance"].attrs["reversed_ping_times"] = 1
     # save location time before reversal correction
     if old_time1 is not None:
-        result.provenance["old_time1"] = old_time1
-        result.provenance.attrs["reversed_ping_times"] = 1
+        result["Provenance"]["old_time1"] = old_time1
+        result["Provenance"].attrs["reversed_ping_times"] = 1
     # save mru time before reversal correction
     if old_time2 is not None:
-        result.provenance["old_time2"] = old_time2
-        result.provenance.attrs["reversed_ping_times"] = 1
+        result["Provenance"]["old_time2"] = old_time2
+        result["Provenance"].attrs["reversed_ping_times"] = 1
     # save time3 before reversal correction
     if old_time3 is not None:
-        result.provenance["old_time3"] = old_time3
-        result.provenance.attrs["reversed_ping_times"] = 1
+        result["Provenance"]["old_time3"] = old_time3
+        result["Provenance"].attrs["reversed_ping_times"] = 1
     # TODO: possible parameter to disable original attributes and original ping_time storage
     # in provenance group?
     # save attrs from before combination
@@ -311,7 +316,7 @@ def combine_echodata(echodatas: List[EchoData], combine_attrs="override") -> Ech
             },
             dims=["echodata_filename", f"{group}_attr_key"],
         )
-        result.provenance = result.provenance.assign({f"{group}_attrs": attrs})
+        result["Provenance"] = result["Provenance"].assign({f"{group}_attrs": attrs})
 
     # Add back sonar model
     result.sonar_model = sonar_model
diff --git a/echopype/echodata/echodata.py b/echopype/echodata/echodata.py
index 07f84526b..ce07064c6 100644
--- a/echopype/echodata/echodata.py
+++ b/echopype/echodata/echodata.py
@@ -204,7 +204,7 @@ def __getitem__(self, __key: Optional[str]) -> Optional[xr.Dataset]:
                 node = self.__get_node(__key)
                 return self.__get_dataset(node)
             except KeyError:
-                raise GroupNotFoundError(__key)
+                return None
         else:
             raise ValueError("Datatree not found!")
 
@@ -219,30 +219,6 @@ def __setitem__(self, __key: Optional[str], __newvalue: Any) -> Optional[xr.Data
         else:
             raise ValueError("Datatree not found!")
 
-    # NOTE: Temporary for now until the attribute access pattern is deprecated
-    def __getattribute__(self, __name: str) -> Any:
-        attr_value = super().__getattribute__(__name)
-        group_map = sonarnetcdf_1.yaml_dict["groups"]
-        if __name in group_map:
-            group = group_map.get(__name)
-            group_path = group["ep_group"]
-            if __name == "top":
-                group_path = "Top-level"
-            msg_list = ["This access pattern will be deprecated in future releases."]
-            if attr_value is not None:
-                msg_list.append(f"Access the group directly by doing echodata['{group_path}']")
-                if self._tree:
-                    if group_path == "Top-level":
-                        node = self._tree
-                    else:
-                        node = self._tree[group_path]
-                    attr_value = self.__get_dataset(node)
-            else:
-                msg_list.append(f"No group path exists for '{self.__class__.__name__}.{__name}'")
-            msg = " ".join(msg_list)
-            warnings.warn(message=msg, category=DeprecationWarning, stacklevel=2)
-        return attr_value
-
     def __setattr__(self, __name: str, __value: Any) -> None:
         attr_value = __value
         if isinstance(__value, DataTree) and __name != "_tree":
@@ -368,7 +344,7 @@ def compute_range(
             - When `sonar_model` is `"AZFP"` and `env_params` does not contain
               either `"sound_speed"` or all of `"temperature"`, `"salinity"`, and `"pressure"`.
             - When `sonar_model` is `"EK60"` or `"EK80"`,
-              EchoData.environment.sound_speed_indicative does not exist,
+              EchoData["Environment"].sound_speed_indicative does not exist,
               and `env_params` does not contain either `"sound_speed"`
               or all of `"temperature"`, `"salinity"`, and `"pressure"`.
             - When `sonar_model` is not `"AZFP"`, `"EK60"`, or `"EK80"`.
@@ -395,8 +371,10 @@ def compute_range(
 
         if "sound_speed" in env_params:
             sound_speed = env_params["sound_speed"]
-        elif self.sonar_model in ("EK60", "EK80") and "sound_speed_indicative" in self.environment:
-            sound_speed = self.environment["sound_speed_indicative"]
+        elif (
+            self.sonar_model in ("EK60", "EK80") and "sound_speed_indicative" in self["Environment"]
+        ):
+            sound_speed = self["Environment"]["sound_speed_indicative"]
         elif all([param in env_params for param in ("temperature", "salinity", "pressure")]):
             sound_speed = calc_sound_speed(
                 env_params["temperature"],
@@ -409,7 +387,8 @@ def compute_range(
                 "sound speed must be specified in env_params, "
                 "with temperature, salinity, and pressure all specified in env_params "
                 "for sound speed to be calculated, "
-                "or in EchoData.environment.sound_speed_indicative for EK60 and EK80 sonar models"
+                "or in EchoData['Environment'].sound_speed_indicative "
+                "for EK60 and EK80 sonar models"
             )
 
         # AZFP
@@ -419,9 +398,9 @@ def compute_range(
                 raise ValueError("azfp_cal_type must be specified when sonar_model is AZFP")
 
             # Notation below follows p.86 of user manual
-            N = self.vendor["number_of_samples_per_average_bin"]  # samples per bin
-            f = self.vendor["digitization_rate"]  # digitization rate
-            L = self.vendor["lockout_index"]  # number of lockout samples
+            N = self["Vendor_specific"]["number_of_samples_per_average_bin"]  # samples per bin
+            f = self["Vendor_specific"]["digitization_rate"]  # digitization rate
+            L = self["Vendor_specific"]["lockout_index"]  # number of lockout samples
 
             # keep this in ref of AZFP matlab code,
             # set to 1 since we want to calculate from raw data
@@ -430,7 +409,7 @@ def compute_range(
             # Harmonize sound_speed time1 and Beam_group1 ping_time
             sound_speed = self._harmonize_env_param_time(
                 p=sound_speed,
-                ping_time=self.beam.ping_time,
+                ping_time=self["Sonar/Beam_group1"].ping_time,
             )
 
             # Calculate range using parameters for each freq
@@ -440,14 +419,15 @@ def compute_range(
                 range_offset = 0
             else:
                 range_offset = (
-                    sound_speed * self.beam["transmit_duration_nominal"] / 4
+                    sound_speed * self["Sonar/Beam_group1"]["transmit_duration_nominal"] / 4
                 )  # from matlab code
             range_meter = (
                 sound_speed * L / (2 * f)
                 + (sound_speed / 4)
                 * (
-                    ((2 * (self.beam.range_sample + 1) - 1) * N * bins_to_avg - 1) / f
-                    + self.beam["transmit_duration_nominal"]
+                    ((2 * (self["Sonar/Beam_group1"].range_sample + 1) - 1) * N * bins_to_avg - 1)
+                    / f
+                    + self["Sonar/Beam_group1"]["transmit_duration_nominal"]
                 )
                 - range_offset
             )
@@ -481,13 +461,13 @@ def compute_range(
                 if (
                     self.sonar_model == "EK80"
                     and encode_mode == "power"
-                    and self.beam_power is not None
+                    and self["Sonar/Beam_group2"] is not None
                 ):
                     # if both CW and BB exist and beam_power group is not empty
                     # this means that CW is recorded in power/angle mode
-                    beam = self.beam_power
+                    beam = self["Sonar/Beam_group2"]
                 else:
-                    beam = self.beam
+                    beam = self["Sonar/Beam_group1"]
 
                 # Harmonize sound_speed time1 and Beam_groupX ping_time
                 sound_speed = self._harmonize_env_param_time(
@@ -501,7 +481,7 @@ def compute_range(
                     beam.range_sample - tvg_correction_factor
                 ) * sample_thickness  # [frequency x range_sample]
             elif waveform_mode == "BB":
-                beam = self.beam  # always use the Beam group
+                beam = self["Sonar/Beam_group1"]  # always use the Beam group
                 # TODO: bug: right now only first ping_time has non-nan range
                 shift = beam["transmit_duration_nominal"]  # based on Lar Anderson's Matlab code
 
@@ -555,7 +535,7 @@ def update_platform(
         extra_platform_data_file_name=None,
     ):
         """
-        Updates the `EchoData.platform` group with additional external platform data.
+        Updates the `EchoData["Platform"]` group with additional external platform data.
 
         `extra_platform_data` must be an xarray Dataset.
         The name of the time dimension in `extra_platform_data` is specified by the
@@ -575,7 +555,7 @@ def update_platform(
         ----------
         extra_platform_data : xr.Dataset
             An `xr.Dataset` containing the additional platform data to be added
-            to the `EchoData.platform` group.
+            to the `EchoData["Platform"]` group.
         time_dim: str, default="time"
             The name of the time dimension in `extra_platform_data`; used for extracting
             data from `extra_platform_data`.
@@ -611,8 +591,8 @@ def update_platform(
             extra_platform_data = extra_platform_data.drop_vars(trajectory_var)
             extra_platform_data = extra_platform_data.swap_dims({"obs": time_dim})
 
-        # clip incoming time to 1 less than min of EchoData.beam["ping_time"] and
-        #   1 greater than max of EchoData.beam["ping_time"]
+        # clip incoming time to 1 less than min of EchoData["Sonar/Beam_group1"]["ping_time"] and
+        #   1 greater than max of EchoData["Sonar/Beam_group1"]["ping_time"]
         # account for unsorted external time by checking whether each time value is between
         #   min and max ping_time instead of finding the 2 external times corresponding to the
         #   min and max ping_time and taking all the times between those indices
@@ -621,7 +601,7 @@ def update_platform(
         # fmt: off
         min_index = max(
             np.searchsorted(
-                sorted_external_time, self.beam["ping_time"].min(), side="left"
+                sorted_external_time, self["Sonar/Beam_group1"]["ping_time"].min(), side="left"
             ) - 1,
             0,
         )
@@ -629,7 +609,7 @@ def update_platform(
         max_index = min(
             np.searchsorted(
                 sorted_external_time,
-                self.beam["ping_time"].max(),
+                self["Sonar/Beam_group1"]["ping_time"].max(),
                 side="right",
             ),
             len(sorted_external_time) - 1,
@@ -643,7 +623,7 @@ def update_platform(
             }
         )
 
-        platform = self.platform
+        platform = self["Platform"]
         platform = platform.drop_dims(["time1"], errors="ignore")
         # drop_dims is also dropping latitude, longitude and sentence_type why?
         platform = platform.assign_coords(time1=extra_platform_data[time_dim].values)
@@ -707,7 +687,7 @@ def mapping_search_variable(mapping, keys, default=None):
                 var_attrs["history"] = history_attr
             platform[var] = platform[var].assign_attrs(**var_attrs)
 
-        self.platform = set_encodings(platform)
+        self["Platform"] = set_encodings(platform)
 
     @classmethod
     def _load_convert(cls, convert_obj):
diff --git a/echopype/tests/calibrate/test_calibrate.py b/echopype/tests/calibrate/test_calibrate.py
index 4cab3c77f..db3fd3b87 100644
--- a/echopype/tests/calibrate/test_calibrate.py
+++ b/echopype/tests/calibrate/test_calibrate.py
@@ -37,7 +37,7 @@ def test_compute_Sv_returns_water_level(ek60_path):
     # make sure the returned Dataset has water_level and throw an assertion error if the
     # EchoData object does not have water_level (just in case we remove it from the file
     # used in the future)
-    assert 'water_level' in ed.platform.data_vars.keys()
+    assert 'water_level' in ed["Platform"].data_vars.keys()
     assert 'water_level' in ds_Sv.data_vars
 
 
@@ -147,7 +147,7 @@ def test_compute_Sv_azfp(azfp_path):
     # Calibrate using identical env params as in Matlab ParametersAZFP.m
     # AZFP Matlab code uses average temperature
     avg_temperature = (
-        echodata.environment['temperature'].mean('time1').values
+        echodata["Environment"]['temperature'].mean('time1').values
     )
     env_params = {
         'temperature': avg_temperature,
@@ -236,7 +236,7 @@ def test_compute_Sv_ek80_pc_echoview(ek80_path):
     )  # compute range [m]
     chirp, _, tau_effective = cal_obj.get_transmit_chirp(waveform_mode="BB")
     freq_center = (
-        echodata.beam["frequency_start"] + echodata.beam["frequency_end"]
+        echodata["Sonar/Beam_group1"]["frequency_start"] + echodata["Sonar/Beam_group1"]["frequency_end"]
     ).dropna(
         dim="channel"
     ) / 2  # drop those that contain CW samples (nan in freq start/end)
diff --git a/echopype/tests/convert/test_convert_azfp.py b/echopype/tests/convert/test_convert_azfp.py
index 4e52bdbea..ddad4da3b 100644
--- a/echopype/tests/convert/test_convert_azfp.py
+++ b/echopype/tests/convert/test_convert_azfp.py
@@ -62,40 +62,40 @@ def test_convert_azfp_01a_matlab_raw(azfp_path):
     # frequency
     assert np.array_equal(
         ds_matlab['Data']['Freq'][0][0].squeeze(),
-        echodata.beam.frequency_nominal / 1000,
+        echodata["Sonar/Beam_group1"].frequency_nominal / 1000,
     )  # matlab file in kHz
     # backscatter count
     assert np.array_equal(
         np.array(
             [ds_matlab_output['Output'][0]['N'][fidx] for fidx in range(4)]
         ),
-        echodata.beam.backscatter_r.isel(beam=0).drop('beam').values,
+        echodata["Sonar/Beam_group1"].backscatter_r.isel(beam=0).drop('beam').values,
     )
 
     # Test vendor group
     # Test temperature
     assert np.array_equal(
         np.array([d[4] for d in ds_matlab['Data']['Ancillary'][0]]).squeeze(),
-        echodata.vendor.ancillary.isel(ancillary_len=4).values,
+        echodata["Vendor_specific"].ancillary.isel(ancillary_len=4).values,
     )
     assert np.array_equal(
         np.array([d[0] for d in ds_matlab['Data']['BatteryTx'][0]]).squeeze(),
-        echodata.vendor.battery_tx,
+        echodata["Vendor_specific"].battery_tx,
     )
     assert np.array_equal(
         np.array(
             [d[0] for d in ds_matlab['Data']['BatteryMain'][0]]
         ).squeeze(),
-        echodata.vendor.battery_main,
+        echodata["Vendor_specific"].battery_main,
     )
     # tilt x-y
     assert np.array_equal(
         np.array([d[0] for d in ds_matlab['Data']['Ancillary'][0]]).squeeze(),
-        echodata.vendor.tilt_x_count,
+        echodata["Vendor_specific"].tilt_x_count,
     )
     assert np.array_equal(
         np.array([d[1] for d in ds_matlab['Data']['Ancillary'][0]]).squeeze(),
-        echodata.vendor.tilt_y_count,
+        echodata["Vendor_specific"].tilt_y_count,
     )
 
     # check convention-required variables in the Platform group
@@ -137,7 +137,7 @@ def test_convert_azfp_01a_raw_echoview(azfp_path):
     echodata = open_raw(
         raw_file=azfp_01a_path, sonar_model='AZFP', xml_path=azfp_xml_path
     )
-    assert np.array_equal(test_power, echodata.beam.backscatter_r.isel(beam=0).drop('beam'))
+    assert np.array_equal(test_power, echodata["Sonar/Beam_group1"].backscatter_r.isel(beam=0).drop('beam'))
 
     # check convention-required variables in the Platform group
     check_platform_required_vars(echodata)
@@ -152,10 +152,10 @@ def test_convert_azfp_01a_different_ranges(azfp_path):
     echodata = open_raw(
         raw_file=azfp_01a_path, sonar_model='AZFP', xml_path=azfp_xml_path
     )
-    assert echodata.beam.backscatter_r.sel(channel='55030-125-1').dropna(
+    assert echodata["Sonar/Beam_group1"].backscatter_r.sel(channel='55030-125-1').dropna(
         'range_sample'
     ).shape == (360, 438, 1)
-    assert echodata.beam.backscatter_r.sel(channel='55030-769-4').dropna(
+    assert echodata["Sonar/Beam_group1"].backscatter_r.sel(channel='55030-769-4').dropna(
         'range_sample'
     ).shape == (360, 135, 1)
 
diff --git a/echopype/tests/convert/test_convert_ek60.py b/echopype/tests/convert/test_convert_ek60.py
index 75d0c7ecf..59de58201 100644
--- a/echopype/tests/convert/test_convert_ek60.py
+++ b/echopype/tests/convert/test_convert_ek60.py
@@ -69,7 +69,7 @@ def test_convert_ek60_matlab_raw(ek60_path):
             ds_matlab['rawData'][0]['pings'][0]['power'][0][fidx]
             for fidx in range(5)
         ],
-        echodata.beam.backscatter_r.isel(beam=0).transpose(
+        echodata["Sonar/Beam_group1"].backscatter_r.isel(beam=0).transpose(
             'channel', 'range_sample', 'ping_time'
         ),
         rtol=0,
@@ -82,7 +82,7 @@ def test_convert_ek60_matlab_raw(ek60_path):
                 ds_matlab['rawData'][0]['pings'][0][angle][0][fidx]
                 for fidx in range(5)
             ],
-            echodata.beam['angle_' + angle].isel(beam=0).transpose(
+            echodata["Sonar/Beam_group1"]['angle_' + angle].isel(beam=0).transpose(
                 'channel', 'range_sample', 'ping_time'
             ),
         )
@@ -113,12 +113,12 @@ def test_convert_ek60_echoview_raw(ek60_path):
 
     # get indices of sorted frequency_nominal values. This is necessary
     # because the frequency_nominal values are not always in ascending order.
-    sorted_freq_ind = np.argsort(echodata.beam.frequency_nominal)
+    sorted_freq_ind = np.argsort(echodata["Sonar/Beam_group1"].frequency_nominal)
 
     for fidx, atol in zip(range(5), [1e-5, 1.1e-5, 1.1e-5, 1e-5, 1e-5]):
         assert np.allclose(
             test_power[fidx, :, :],
-            echodata.beam.backscatter_r.isel(
+            echodata["Sonar/Beam_group1"].backscatter_r.isel(
                 channel=sorted_freq_ind[fidx],
                 ping_time=slice(None, 10),
                 range_sample=slice(1, None),
@@ -166,8 +166,8 @@ def test_convert_ek60_duplicate_ping_times(ek60_path):
     )
     ed = open_raw(raw_path, "EK60")
 
-    assert "duplicate_ping_times" in ed.provenance.attrs
-    assert "old_ping_time" in ed.provenance
+    assert "duplicate_ping_times" in ed["Provenance"].attrs
+    assert "old_ping_time" in ed["Provenance"]
 
 
 def test_convert_ek60_duplicate_frequencies(ek60_path):
diff --git a/echopype/tests/convert/test_convert_ek80.py b/echopype/tests/convert/test_convert_ek80.py
index e6fef9e85..fbba98328 100644
--- a/echopype/tests/convert/test_convert_ek80.py
+++ b/echopype/tests/convert/test_convert_ek80.py
@@ -105,7 +105,7 @@ def test_convert_ek80_complex_matlab(ek80_path):
     # Test complex parsed data
     ds_matlab = loadmat(ek80_matlab_path_bb)
     assert np.array_equal(
-        echodata.beam.backscatter_r.sel(channel='WBT 549762-15 ES70-7C',
+        echodata["Sonar/Beam_group1"].backscatter_r.sel(channel='WBT 549762-15 ES70-7C',
                                         ping_time='2017-09-12T23:49:10.722999808')
         .dropna('range_sample')
         .values[1:, :],
@@ -114,7 +114,7 @@ def test_convert_ek80_complex_matlab(ek80_path):
         ),  # real part
     )
     assert np.array_equal(
-        echodata.beam.backscatter_i.sel(channel='WBT 549762-15 ES70-7C',
+        echodata["Sonar/Beam_group1"].backscatter_i.sel(channel='WBT 549762-15 ES70-7C',
                                         ping_time='2017-09-12T23:49:10.722999808')
         .dropna('range_sample')
         .values[1:, :],
@@ -173,22 +173,22 @@ def test_convert_ek80_cw_power_angle_echoview(ek80_path):
 
     # get indices of sorted frequency_nominal values. This is necessary
     # because the frequency_nominal values are not always in ascending order.
-    sorted_freq_ind = np.argsort(echodata.beam.frequency_nominal)
+    sorted_freq_ind = np.argsort(echodata["Sonar/Beam_group1"].frequency_nominal)
 
     # get sorted channel list based on frequency_nominal values
-    channel_list = echodata.beam.channel[sorted_freq_ind.values]
+    channel_list = echodata["Sonar/Beam_group1"].channel[sorted_freq_ind.values]
 
     # check water_level
     assert (echodata["Platform"]["water_level"] == 0).all()
 
     # Test power
     # single point error in original raw data. Read as -2000 by echopype and -999 by EchoView
-    echodata.beam.backscatter_r[sorted_freq_ind.values[3], 4, 13174] = -999
+    echodata["Sonar/Beam_group1"].backscatter_r[sorted_freq_ind.values[3], 4, 13174] = -999
     for file, chan in zip(ek80_echoview_power_csv, channel_list):
         test_power = pd.read_csv(file, delimiter=';').iloc[:, 13:].values
         assert np.allclose(
             test_power,
-            echodata.beam.backscatter_r.sel(channel=chan,
+            echodata["Sonar/Beam_group1"].backscatter_r.sel(channel=chan,
                                             beam='1').dropna('range_sample'),
             rtol=0,
             atol=1.1e-5,
@@ -196,16 +196,16 @@ def test_convert_ek80_cw_power_angle_echoview(ek80_path):
 
     # Convert from electrical angles to physical angle [deg]
     major = (
-        echodata.beam['angle_athwartship']
+        echodata["Sonar/Beam_group1"]['angle_athwartship']
         * 1.40625
-        / echodata.beam['angle_sensitivity_athwartship']
-        - echodata.beam['angle_offset_athwartship']
+        / echodata["Sonar/Beam_group1"]['angle_sensitivity_athwartship']
+        - echodata["Sonar/Beam_group1"]['angle_offset_athwartship']
     )
     minor = (
-        echodata.beam['angle_alongship']
+        echodata["Sonar/Beam_group1"]['angle_alongship']
         * 1.40625
-        / echodata.beam['angle_sensitivity_alongship']
-        - echodata.beam['angle_offset_alongship']
+        / echodata["Sonar/Beam_group1"]['angle_sensitivity_alongship']
+        - echodata["Sonar/Beam_group1"]['angle_offset_alongship']
     )
     for chan, file in zip(channel_list, ek80_echoview_angle_csv):
         df_angle = pd.read_csv(file)
@@ -275,7 +275,7 @@ def test_convert_ek80_complex_echoview(ek80_path):
         ek80_echoview_bb_power_csv, header=None, skiprows=[0]
     )  # averaged across beams
     assert np.allclose(
-        echodata.beam.backscatter_r.sel(channel='WBT 549762-15 ES70-7C')
+        echodata["Sonar/Beam_group1"].backscatter_r.sel(channel='WBT 549762-15 ES70-7C')
         .dropna('range_sample')
         .mean(dim='beam'),
         df_bb.iloc[::2, 14:],  # real rows
@@ -283,7 +283,7 @@ def test_convert_ek80_complex_echoview(ek80_path):
         atol=8e-6,
     )
     assert np.allclose(
-        echodata.beam.backscatter_i.sel(channel='WBT 549762-15 ES70-7C')
+        echodata["Sonar/Beam_group1"].backscatter_i.sel(channel='WBT 549762-15 ES70-7C')
         .dropna('range_sample')
         .mean(dim='beam'),
         df_bb.iloc[1::2, 14:],  # imag rows
@@ -325,8 +325,8 @@ def test_convert_ek80_cw_bb_in_single_file(ek80_path):
     echodata = open_raw(raw_file=ek80_raw_path_bb_cw, sonar_model='EK80')
 
     # Check there are both Sonar/Beam_group1 and /Sonar/Beam_power groups in the converted file
-    assert echodata.beam_power is not None
-    assert echodata.beam is not None
+    assert echodata["Sonar/Beam_group2"]
+    assert echodata["Sonar/Beam_group1"]
 
     # check platform
     nan_plat_vars = [
@@ -366,7 +366,7 @@ def test_convert_ek80_freq_subset(ek80_path):
     echodata = open_raw(raw_file=ek80_raw_path_freq_subset, sonar_model='EK80')
 
     # Check if converted output has only 2 frequency channels
-    assert echodata.beam.channel.size == 2
+    assert echodata["Sonar/Beam_group1"].channel.size == 2
 
     # check platform
     nan_plat_vars = [
diff --git a/echopype/tests/convert/test_convert_source_target_locs.py b/echopype/tests/convert/test_convert_source_target_locs.py
index d0e96802a..c4f22f73f 100644
--- a/echopype/tests/convert/test_convert_source_target_locs.py
+++ b/echopype/tests/convert/test_convert_source_target_locs.py
@@ -239,34 +239,36 @@ def test_convert_time_encodings(sonar_model, raw_file, xml_path, test_path):
     )
     ed.to_netcdf(overwrite=True)
     for group, details in ed.group_map.items():
-        if hasattr(ed, group):
-            group_ds = getattr(ed, group)
-            if isinstance(group_ds, xr.Dataset):
-                for var, encoding in DEFAULT_ENCODINGS.items():
-                    if var in group_ds:
-                        da = group_ds[var]
-                        assert da.encoding == encoding
-
-                        # Combine encoding and attributes since this
-                        # is what is shown when using decode_cf=False
-                        # without dtype attribute
-                        total_attrs = dict(**da.attrs, **da.encoding)
-                        total_attrs.pop('dtype')
-
-                        # Read converted file back in
-                        file_da = xr.open_dataset(
-                            ed.converted_raw_path,
-                            group=details['ep_group'],
-                            decode_cf=False,
-                        )[var]
-                        assert file_da.dtype == encoding['dtype']
-
-                        # Read converted file back in
-                        decoded_da = xr.open_dataset(
-                            ed.converted_raw_path,
-                            group=details['ep_group'],
-                        )[var]
-                        assert da.equals(decoded_da) is True
+        group_path = details['ep_group']
+        if group_path is None:
+            group_path = 'Top-level'
+        group_ds = ed[group_path]
+        if isinstance(group_ds, xr.Dataset):
+            for var, encoding in DEFAULT_ENCODINGS.items():
+                if var in group_ds:
+                    da = group_ds[var]
+                    assert da.encoding == encoding
+
+                    # Combine encoding and attributes since this
+                    # is what is shown when using decode_cf=False
+                    # without dtype attribute
+                    total_attrs = dict(**da.attrs, **da.encoding)
+                    total_attrs.pop('dtype')
+
+                    # Read converted file back in
+                    file_da = xr.open_dataset(
+                        ed.converted_raw_path,
+                        group=details['ep_group'],
+                        decode_cf=False,
+                    )[var]
+                    assert file_da.dtype == encoding['dtype']
+
+                    # Read converted file back in
+                    decoded_da = xr.open_dataset(
+                        ed.converted_raw_path,
+                        group=details['ep_group'],
+                    )[var]
+                    assert da.equals(decoded_da) is True
     os.unlink(ed.converted_raw_path)
 
 
diff --git a/echopype/tests/echodata/test_echodata.py b/echopype/tests/echodata/test_echodata.py
index 4a4c3dcec..f22b38724 100644
--- a/echopype/tests/echodata/test_echodata.py
+++ b/echopype/tests/echodata/test_echodata.py
@@ -198,20 +198,20 @@ def converted_zarr(self, single_ek60_zarr):
     def test_constructor(self, converted_zarr):
         ed = EchoData.from_file(converted_raw_path=converted_zarr)
         expected_groups = [
-            'top',
-            'environment',
-            'platform',
-            'provenance',
-            'sonar',
-            'beam',
-            'vendor',
+            'Top-level',
+            'Environment',
+            'Platform',
+            'Provenance',
+            'Sonar',
+            'Sonar/Beam_group1',
+            'Vendor_specific',
         ]
 
         assert ed.sonar_model == 'EK60'
         assert ed.converted_raw_path == converted_zarr
         assert ed.storage_options == {}
         for group in expected_groups:
-            assert isinstance(getattr(ed, group), xr.Dataset)
+            assert isinstance(ed[group], xr.Dataset)
 
     def test_repr(self, converted_zarr):
         zarr_path_string = str(converted_zarr.absolute())
@@ -252,27 +252,22 @@ def test_setattr(self, converted_zarr):
         sample_data = xr.Dataset({"x": [0, 0, 0]})
         sample_data2 = xr.Dataset({"y": [0, 0, 0]})
         ed = EchoData.from_file(converted_raw_path=converted_zarr)
-        current_ed_beam = ed.beam
-        current_ed_top = ed.top
-        ed.beam = sample_data
-        ed.top = sample_data2
+        current_ed_beam = ed["Sonar/Beam_group1"]
+        current_ed_top = ed['Top-level']
+        ed["Sonar/Beam_group1"] = sample_data
+        ed['Top-level'] = sample_data2
 
-        assert ed.beam.equals(sample_data) is True
-        assert ed.beam.equals(ed['Sonar/Beam_group1']) is True
-        assert ed.beam.equals(current_ed_beam) is False
+        assert ed["Sonar/Beam_group1"].equals(sample_data) is True
+        assert ed["Sonar/Beam_group1"].equals(current_ed_beam) is False
 
-        assert ed.top.equals(sample_data2) is True
-        assert ed.top.equals(ed['Top-level']) is True
-        assert ed.top.equals(current_ed_top) is False
+        assert ed['Top-level'].equals(sample_data2) is True
+        assert ed['Top-level'].equals(current_ed_top) is False
 
     def test_getitem(self, converted_zarr):
         ed = EchoData.from_file(converted_raw_path=converted_zarr)
         beam = ed['Sonar/Beam_group1']
         assert isinstance(beam, xr.Dataset)
-        try:
-            ed['MyGroup']
-        except Exception as e:
-            assert isinstance(e, GroupNotFoundError)
+        assert ed['MyGroup'] is None
 
         ed._tree = None
         try:
@@ -280,28 +275,17 @@ def test_getitem(self, converted_zarr):
         except Exception as e:
             assert isinstance(e, ValueError)
 
-    def test_getattr(self, converted_zarr):
-        ed = EchoData.from_file(converted_raw_path=converted_zarr)
-        expected_groups = {
-            'top': 'Top-level',
-            'environment': 'Environment',
-            'platform': 'Platform',
-            'nmea': 'Platform/NMEA',
-            'provenance': 'Provenance',
-            'sonar': 'Sonar',
-            'beam': 'Sonar/Beam_group1',
-            'vendor': 'Vendor_specific',
-        }
-        for group, path in expected_groups.items():
-            ds = getattr(ed, group)
-            assert ds.equals(ed[path])
-
     def test_setitem(self, converted_zarr):
         ed = EchoData.from_file(converted_raw_path=converted_zarr)
         ed['Sonar/Beam_group1'] = ed['Sonar/Beam_group1'].rename({'beam': 'beam_newname'})
 
         assert sorted(ed['Sonar/Beam_group1'].dims.keys()) == ['beam_newname', 'channel', 'ping_time', 'range_sample']
 
+        try:
+            ed['SomeRandomGroup'] = 'Testing value'
+        except Exception as e:
+            assert isinstance(e, GroupNotFoundError)
+
     def test_get_dataset(self, converted_zarr):
         ed = EchoData.from_file(converted_raw_path=converted_zarr)
         node = DataTree()
@@ -352,7 +336,6 @@ def test_compute_range(compute_range_samples):
         ek_encode_mode,
     ) = compute_range_samples
     ed = echopype.open_raw(filepath, sonar_model, azfp_xml_path)
-    print(ed.platform)
     rng = np.random.default_rng(0)
     stationary_env_params = EnvParams(
         xr.Dataset(
@@ -367,7 +350,7 @@ def test_compute_range(compute_range_samples):
         ),
         data_kind="stationary"
     )
-    if "time3" in ed.platform and sonar_model != "AD2CP":
+    if "time3" in ed["Platform"] and sonar_model != "AD2CP":
         ed.compute_range(stationary_env_params, azfp_cal_type, ek_waveform_mode)
     else:
         try:
@@ -392,7 +375,7 @@ def test_compute_range(compute_range_samples):
         ),
         data_kind="mobile"
     )
-    if "latitude" in ed.platform and "longitude" in ed.platform and sonar_model != "AD2CP" and not np.isnan(ed.platform["time1"]).all():
+    if "latitude" in ed["Platform"] and "longitude" in ed["Platform"] and sonar_model != "AD2CP" and not np.isnan(ed["Platform"]["time1"]).all():
         ed.compute_range(mobile_env_params, azfp_cal_type, ek_waveform_mode)
     else:
         try:
@@ -427,11 +410,11 @@ def test_nan_range_entries(range_check_files):
     if sonar_model == "EK80":
         ds_Sv = echopype.calibrate.compute_Sv(echodata, waveform_mode='BB', encode_mode='complex')
         range_output = echodata.compute_range(env_params=[], ek_waveform_mode='BB')
-        nan_locs_backscatter_r = ~echodata.beam.backscatter_r.isel(beam=0).drop("beam").isnull()
+        nan_locs_backscatter_r = ~echodata["Sonar/Beam_group1"].backscatter_r.isel(beam=0).drop("beam").isnull()
     else:
         ds_Sv = echopype.calibrate.compute_Sv(echodata)
         range_output = echodata.compute_range(env_params=[])
-        nan_locs_backscatter_r = ~echodata.beam.backscatter_r.isel(beam=0).drop("beam").isnull()
+        nan_locs_backscatter_r = ~echodata["Sonar/Beam_group1"].backscatter_r.isel(beam=0).drop("beam").isnull()
 
     nan_locs_Sv_range = ~ds_Sv.echo_range.isnull()
     nan_locs_range = ~range_output.isnull()
@@ -482,7 +465,7 @@ def test_update_platform(
     ed = echopype.open_raw(raw_file, sonar_model=sonar_model)
 
     for variable in updated:
-        assert np.isnan(ed.platform[variable].values).all()
+        assert np.isnan(ed["Platform"][variable].values).all()
 
     if ext_type == "external-trajectory":
         extra_platform_data_file_name = platform_data[1]
@@ -507,30 +490,30 @@ def test_update_platform(
     )
 
     for variable in updated:
-        assert not np.isnan(ed.platform[variable].values).all()
+        assert not np.isnan(ed["Platform"][variable].values).all()
 
     # times have max interval of 2s
-    # check times are > min(ed.beam["ping_time"]) - 2s
+    # check times are > min(ed["Sonar/Beam_group1"]["ping_time"]) - 2s
     assert (
-        ed.platform["time1"]
-        > ed.beam["ping_time"].min() - np.timedelta64(2, "s")
+        ed["Platform"]["time1"]
+        > ed["Sonar/Beam_group1"]["ping_time"].min() - np.timedelta64(2, "s")
     ).all()
-    # check there is only 1 time < min(ed.beam["ping_time"])
+    # check there is only 1 time < min(ed["Sonar/Beam_group1"]["ping_time"])
     assert (
         np.count_nonzero(
-            ed.platform["time1"] < ed.beam["ping_time"].min()
+            ed["Platform"]["time1"] < ed["Sonar/Beam_group1"]["ping_time"].min()
         )
         <= 1
     )
-    # check times are < max(ed.beam["ping_time"]) + 2s
+    # check times are < max(ed["Sonar/Beam_group1"]["ping_time"]) + 2s
     assert (
-        ed.platform["time1"]
-        < ed.beam["ping_time"].max() + np.timedelta64(2, "s")
+        ed["Platform"]["time1"]
+        < ed["Sonar/Beam_group1"]["ping_time"].max() + np.timedelta64(2, "s")
     ).all()
-    # check there is only 1 time > max(ed.beam["ping_time"])
+    # check there is only 1 time > max(ed["Sonar/Beam_group1"]["ping_time"])
     assert (
         np.count_nonzero(
-            ed.platform["time1"] > ed.beam["ping_time"].max()
+            ed["Platform"]["time1"] > ed["Sonar/Beam_group1"]["ping_time"].max()
         )
         <= 1
     )
diff --git a/echopype/tests/echodata/test_echodata_combine.py b/echopype/tests/echodata/test_echodata_combine.py
index 754bf4df9..229e3178e 100644
--- a/echopype/tests/echodata/test_echodata_combine.py
+++ b/echopype/tests/echodata/test_echodata_combine.py
@@ -107,14 +107,14 @@ def test_combine_echodata(raw_datasets):
     eds = [echopype.open_raw(file, sonar_model, xml_file) for file in files]
     combined = echopype.combine_echodata(eds, "overwrite_conflicts")  # type: ignore
 
-    for group_name in combined.group_map:
+    for group_name, value in combined.group_map.items():
         if group_name in ("top", "sonar", "provenance"):
             continue
-        combined_group: xr.Dataset = getattr(combined, group_name)
+        combined_group: xr.Dataset = combined[value['ep_group']]
         eds_groups = [
-            getattr(ed, group_name)
+            ed[value['ep_group']]
             for ed in eds
-            if getattr(ed, group_name) is not None
+            if ed[value['ep_group']] is not None
         ]
 
         def union_attrs(datasets: List[xr.Dataset]) -> Dict[str, Any]:
@@ -140,6 +140,7 @@ def union_attrs(datasets: List[xr.Dataset]) -> Dict[str, Any]:
         test_ds.attrs.update(union_attrs(eds_groups))
         test_ds = test_ds.drop_dims(
             [
+                # xarray inserts "concat_dim" when concatenating along multiple dimensions
                 "concat_dim",
                 "old_ping_time",
                 "ping_time",
@@ -174,8 +175,11 @@ def test_ping_time_reversal(ek60_reversed_ping_time_test_data):
     ]
     combined = echopype.combine_echodata(eds, "overwrite_conflicts")  # type: ignore
 
-    for group_name in combined.group_map:
-        combined_group: xr.Dataset = getattr(combined, group_name)
+    for group_name, value in combined.group_map.items():
+        if value['ep_group'] is None:
+            combined_group: xr.Dataset = combined['Top-level']
+        else:
+            combined_group: xr.Dataset = combined[value['ep_group']]
 
         if combined_group is not None:
             if "ping_time" in combined_group and group_name != "provenance":
@@ -199,11 +203,15 @@ def test_attr_storage(ek60_test_data):
     # check storage of attributes before combination in provenance group
     eds = [echopype.open_raw(file, "EK60") for file in ek60_test_data]
     combined = echopype.combine_echodata(eds, "overwrite_conflicts")  # type: ignore
-    for group in combined.group_map:
-        if f"{group}_attrs" in combined.provenance:
-            group_attrs = combined.provenance[f"{group}_attrs"]
+    for group, value in combined.group_map.items():
+        if value['ep_group'] is None:
+            group_path = 'Top-level'
+        else:
+            group_path = value['ep_group']
+        if f"{group}_attrs" in combined["Provenance"]:
+            group_attrs = combined["Provenance"][f"{group}_attrs"]
             for i, ed in enumerate(eds):
-                for attr, value in getattr(ed, group).attrs.items():
+                for attr, value in ed[group_path].attrs.items():
                     assert str(
                         group_attrs.isel(echodata_filename=i)
                         .sel({f"{group}_attr_key": attr})
@@ -212,10 +220,10 @@ def test_attr_storage(ek60_test_data):
 
     # check selection by echodata_filename
     for file in ek60_test_data:
-        assert Path(file).name in combined.provenance["echodata_filename"]
+        assert Path(file).name in combined["Provenance"]["echodata_filename"]
     for group in combined.group_map:
-        if f"{group}_attrs" in combined.provenance:
-            group_attrs = combined.provenance[f"{group}_attrs"]
+        if f"{group}_attrs" in combined["Provenance"]:
+            group_attrs = combined["Provenance"][f"{group}_attrs"]
             assert np.array_equal(
                 group_attrs.sel(
                     echodata_filename=Path(ek60_test_data[0]).name
@@ -227,15 +235,15 @@ def test_attr_storage(ek60_test_data):
 def test_combine_attrs(ek60_test_data):
     # check parameter passed to combine_echodata that controls behavior of attribute combination
     eds = [echopype.open_raw(file, "EK60") for file in ek60_test_data]
-    eds[0].beam.attrs.update({"foo": 1})
-    eds[1].beam.attrs.update({"foo": 2})
-    eds[2].beam.attrs.update({"foo": 3})
+    eds[0]["Sonar/Beam_group1"].attrs.update({"foo": 1})
+    eds[1]["Sonar/Beam_group1"].attrs.update({"foo": 2})
+    eds[2]["Sonar/Beam_group1"].attrs.update({"foo": 3})
 
     combined = echopype.combine_echodata(eds, "override")  # type: ignore
-    assert combined.beam.attrs["foo"] == 1
+    assert combined["Sonar/Beam_group1"].attrs["foo"] == 1
 
     combined = echopype.combine_echodata(eds, "drop")  # type: ignore
-    assert "foo" not in combined.beam.attrs
+    assert "foo" not in combined["Sonar/Beam_group1"].attrs
 
     try:
         combined = echopype.combine_echodata(eds, "identical")  # type: ignore
@@ -252,17 +260,17 @@ def test_combine_attrs(ek60_test_data):
         raise AssertionError
 
     combined = echopype.combine_echodata(eds, "overwrite_conflicts")  # type: ignore
-    assert combined.beam.attrs["foo"] == 3
+    assert combined["Sonar/Beam_group1"].attrs["foo"] == 3
 
-    eds[0].beam.attrs.update({"foo": 1})
-    eds[1].beam.attrs.update({"foo": 1})
-    eds[2].beam.attrs.update({"foo": 1})
+    eds[0]["Sonar/Beam_group1"].attrs.update({"foo": 1})
+    eds[1]["Sonar/Beam_group1"].attrs.update({"foo": 1})
+    eds[2]["Sonar/Beam_group1"].attrs.update({"foo": 1})
 
     combined = echopype.combine_echodata(eds, "identical")  # type: ignore
-    assert combined.beam.attrs["foo"] == 1
+    assert combined["Sonar/Beam_group1"].attrs["foo"] == 1
 
     combined = echopype.combine_echodata(eds, "no_conflicts")  # type: ignore
-    assert combined.beam.attrs["foo"] == 1
+    assert combined["Sonar/Beam_group1"].attrs["foo"] == 1
 
 
 def test_combined_encodings(ek60_test_data):
@@ -270,15 +278,19 @@ def test_combined_encodings(ek60_test_data):
     combined = echopype.combine_echodata(eds, "overwrite_conflicts")  # type: ignore
 
     group_checks = []
-    for group in combined.group_map:
-        ds = getattr(combined, group)
+    for group, value in combined.group_map.items():
+        if value['ep_group'] is None:
+            ds = combined['Top-level']
+        else:
+            ds = combined[value['ep_group']]
+
         if ds is not None:
             for k, v in ds.variables.items():
                 if k in DEFAULT_ENCODINGS:
                     encoding = ds[k].encoding
                     if encoding != DEFAULT_ENCODINGS[k]:
                         group_checks.append(
-                            f"  {combined.group_map[group]['name']}::{k}"
+                            f"  {value['name']}::{k}"
                         )
 
     if len(group_checks) > 0:
diff --git a/echopype/tests/preprocess/test_preprocess.py b/echopype/tests/preprocess/test_preprocess.py
index 2a0ecc3c4..58be7cff0 100644
--- a/echopype/tests/preprocess/test_preprocess.py
+++ b/echopype/tests/preprocess/test_preprocess.py
@@ -495,7 +495,7 @@ def test_preprocess_mvbs(test_data_samples):
     ed = ep.open_raw(filepath, sonar_model, azfp_xml_path)
     if ed.sonar_model.lower() == 'azfp':
         avg_temperature = (
-            ed.environment['temperature'].mean('time1').values
+            ed["Environment"]['temperature'].mean('time1').values
         )
         env_params = {
             'temperature': avg_temperature,
diff --git a/echopype/tests/visualize/test_plot.py b/echopype/tests/visualize/test_plot.py
index be6ed34db..caaa32550 100644
--- a/echopype/tests/visualize/test_plot.py
+++ b/echopype/tests/visualize/test_plot.py
@@ -88,7 +88,7 @@ def test_plot_single(
     # TODO: Need to figure out how to compare the actual rendered plots
     ed = echopype.open_raw(filepath, sonar_model, azfp_xml_path)
     plots = echopype.visualize.create_echogram(
-        ed, channel=ed.beam.channel[0].values
+        ed, channel=ed["Sonar/Beam_group1"].channel[0].values
     )
     assert isinstance(plots, list) is True
     if (
@@ -111,7 +111,7 @@ def test_plot_multi_get_range(
     ed = echopype.open_raw(filepath, sonar_model, azfp_xml_path)
     if ed.sonar_model.lower() == 'azfp':
         avg_temperature = (
-            ed.environment['temperature'].mean('time1').values
+            ed["Environment"]['temperature'].mean('time1').values
         )
         env_params = {
             'temperature': avg_temperature,
@@ -135,7 +135,7 @@ def test_plot_multi_get_range(
         assert plots[0].axes.shape[-1] == 1
 
     # Channel shape check
-    assert ed.beam.channel.shape[0] == len(plots)
+    assert ed["Sonar/Beam_group1"].channel.shape[0] == len(plots)
 
 
 @pytest.mark.parametrize(param_args, param_testdata)
@@ -149,7 +149,7 @@ def test_plot_Sv(
     ed = echopype.open_raw(filepath, sonar_model, azfp_xml_path)
     if ed.sonar_model.lower() == 'azfp':
         avg_temperature = (
-            ed.environment['temperature'].mean('time1').values
+            ed["Environment"]['temperature'].mean('time1').values
         )
         env_params = {
             'temperature': avg_temperature,
@@ -176,7 +176,7 @@ def test_plot_mvbs(
     ed = echopype.open_raw(filepath, sonar_model, azfp_xml_path)
     if ed.sonar_model.lower() == 'azfp':
         avg_temperature = (
-            ed.environment['temperature'].mean('time1').values
+            ed["Environment"]['temperature'].mean('time1').values
         )
         env_params = {
             'temperature': avg_temperature,
@@ -237,7 +237,7 @@ def test_water_level_echodata(water_level, expect_warning):
 
     if isinstance(water_level, list):
         water_level = water_level[0]
-        echodata.platform = echodata.platform.drop_vars('water_level')
+        echodata["Platform"] = echodata["Platform"].drop_vars('water_level')
         no_input_water_level = True
 
     if isinstance(water_level, xr.DataArray):
@@ -247,7 +247,7 @@ def test_water_level_echodata(water_level, expect_warning):
         if no_input_water_level is False:
             original_array = (
                 single_array
-                + echodata.platform.water_level.sel(channel='GPT  18 kHz 009072058c8d 1-1 ES18-11',
+                + echodata["Platform"].water_level.sel(channel='GPT  18 kHz 009072058c8d 1-1 ES18-11',
                                                     time3='2017-07-19T21:13:47.984999936').values
             )
         else:
@@ -265,14 +265,14 @@ def test_water_level_echodata(water_level, expect_warning):
                     range_in_meter=range_in_meter,
                     water_level=water_level,
                     data_type=EchoData,
-                    platform_data=echodata.platform,
+                    platform_data=echodata["Platform"],
                 )
         else:
             results = _add_water_level(
                 range_in_meter=range_in_meter,
                 water_level=water_level,
                 data_type=EchoData,
-                platform_data=echodata.platform,
+                platform_data=echodata["Platform"],
             )
     except Exception as e:
         assert isinstance(e, ValueError)
diff --git a/echopype/visualize/api.py b/echopype/visualize/api.py
index 544a32579..992aa323d 100644
--- a/echopype/visualize/api.py
+++ b/echopype/visualize/api.py
@@ -84,7 +84,7 @@ def create_echogram(
             )
         yaxis = 'range_sample'
         variable = 'backscatter_r'
-        ds = data.beam
+        ds = data["Sonar/Beam_group1"]
         if 'ping_time' in ds:
             _check_ping_time(ds.ping_time)
         if get_range is True:
@@ -147,7 +147,7 @@ def create_echogram(
                     range_in_meter=range_in_meter,
                     water_level=water_level,
                     data_type=EchoData,
-                    platform_data=data.platform,
+                    platform_data=data["Platform"],
                 )
             ds = ds.assign_coords({'echo_range': range_in_meter})
             ds.echo_range.attrs = range_attrs

From 96ed2fd9388a631192e410f5eec05379a5ecc083 Mon Sep 17 00:00:00 2001
From: b-reyes <53541061+b-reyes@users.noreply.github.com>
Date: Thu, 11 Aug 2022 08:26:08 -0700
Subject: [PATCH 17/23] change the order in _save_groups_to_file so they are
 the same as the EchoData structure (#779)

---
 echopype/convert/api.py | 60 ++++++++++++++++++++---------------------
 1 file changed, 30 insertions(+), 30 deletions(-)

diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index 3e04e31e5..26fbb3e4f 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -107,15 +107,6 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     # Top-level group
     io.save_file(echodata["Top-level"], path=output_path, mode="w", engine=engine)
 
-    # Provenance group
-    io.save_file(
-        echodata["Provenance"],
-        path=output_path,
-        group="Provenance",
-        mode="a",
-        engine=engine,
-    )
-
     # Environment group
     if "time1" in echodata["Environment"]:
         io.save_file(
@@ -136,6 +127,36 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
             group="Environment",
         )
 
+    # Platform group
+    io.save_file(
+        echodata["Platform"],  # TODO: chunking necessary? time1 and time2 (EK80) only
+        path=output_path,
+        mode="a",
+        engine=engine,
+        group="Platform",
+        compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
+    )
+
+    # Platform/NMEA group: some sonar model does not produce NMEA data
+    if echodata["Platform/NMEA"] is not None:
+        io.save_file(
+            echodata["Platform/NMEA"],  # TODO: chunking necessary?
+            path=output_path,
+            mode="a",
+            engine=engine,
+            group="Platform/NMEA",
+            compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
+        )
+
+    # Provenance group
+    io.save_file(
+        echodata["Provenance"],
+        path=output_path,
+        group="Provenance",
+        mode="a",
+        engine=engine,
+    )
+
     # Sonar group
     io.save_file(
         echodata["Sonar"],
@@ -190,27 +211,6 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
                 compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
             )
 
-    # Platform group
-    io.save_file(
-        echodata["Platform"],  # TODO: chunking necessary? time1 and time2 (EK80) only
-        path=output_path,
-        mode="a",
-        engine=engine,
-        group="Platform",
-        compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
-    )
-
-    # Platform/NMEA group: some sonar model does not produce NMEA data
-    if echodata["Platform/NMEA"] is not None:
-        io.save_file(
-            echodata["Platform/NMEA"],  # TODO: chunking necessary?
-            path=output_path,
-            mode="a",
-            engine=engine,
-            group="Platform/NMEA",
-            compression_settings=COMPRESSION_SETTINGS[engine] if compress else None,
-        )
-
     # Vendor_specific group
     if "ping_time" in echodata["Vendor_specific"]:
         io.save_file(

From 0e226338744577af9f49d2171d4f3a15197d88fa Mon Sep 17 00:00:00 2001
From: b-reyes <53541061+b-reyes@users.noreply.github.com>
Date: Thu, 11 Aug 2022 08:48:50 -0700
Subject: [PATCH 18/23] Remove the user option to select NMEA sentences (#778)

* remove the user option to select NMEA sentences

* change _parse_NMEA function name to _extract_NMEA_latlon
---
 echopype/convert/api.py             | 6 ------
 echopype/convert/set_groups_base.py | 6 ++++--
 echopype/convert/set_groups_ek60.py | 2 +-
 echopype/convert/set_groups_ek80.py | 2 +-
 4 files changed, 6 insertions(+), 10 deletions(-)

diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index 26fbb3e4f..9400d7019 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -25,8 +25,6 @@
 
 DEFAULT_CHUNK_SIZE = {"range_sample": 25000, "ping_time": 2500}
 
-NMEA_SENTENCE_DEFAULT = ["GGA", "GLL", "RMC"]
-
 BEAM_SUBGROUP_DEFAULT = "Beam_group1"
 
 
@@ -239,9 +237,6 @@ def _set_convert_params(param_dict: Dict[str, str]) -> Dict[str, str]:
 
     The default set of parameters include:
     - Platform group: ``platform_name``, ``platform_type``, ``platform_code_ICES``, ``water_level``
-    - Platform/NMEA: ``nmea_gps_sentence``,
-                     for selecting specific NMEA sentences,
-                     with default values ['GGA', 'GLL', 'RMC'].
     - Top-level group: ``survey_name``
 
     Other parameters will be saved to the top level.
@@ -262,7 +257,6 @@ def _set_convert_params(param_dict: Dict[str, str]) -> Dict[str, str]:
     out_params["platform_code_ICES"] = param_dict.get("platform_code_ICES", "")
     out_params["platform_type"] = param_dict.get("platform_type", "")
     out_params["water_level"] = param_dict.get("water_level", None)
-    out_params["nmea_gps_sentence"] = param_dict.get("nmea_gps_sentence", NMEA_SENTENCE_DEFAULT)
 
     # Parameters for the Top-level group
     out_params["survey_name"] = param_dict.get("survey_name", "")
diff --git a/echopype/convert/set_groups_base.py b/echopype/convert/set_groups_base.py
index 13088c877..5f0e118dd 100644
--- a/echopype/convert/set_groups_base.py
+++ b/echopype/convert/set_groups_base.py
@@ -11,6 +11,8 @@
 
 DEFAULT_CHUNK_SIZE = {"range_sample": 25000, "ping_time": 2500}
 
+NMEA_SENTENCE_DEFAULT = ["GGA", "GLL", "RMC"]
+
 
 class SetGroupsBase(abc.ABC):
     """Base class for saving groups to netcdf or zarr from echosounder data files."""
@@ -143,10 +145,10 @@ def set_vendor(self) -> xr.Dataset:
         raise NotImplementedError
 
     # TODO: move this to be part of parser as it is not a "set" operation
-    def _parse_NMEA(self):
+    def _extract_NMEA_latlon(self):
         """Get the lat and lon values from the raw nmea data"""
         messages = [string[3:6] for string in self.parser_obj.nmea["nmea_string"]]
-        idx_loc = np.argwhere(np.isin(messages, self.ui_param["nmea_gps_sentence"])).squeeze()
+        idx_loc = np.argwhere(np.isin(messages, NMEA_SENTENCE_DEFAULT)).squeeze()
         if idx_loc.size == 1:  # in case of only 1 matching message
             idx_loc = np.expand_dims(idx_loc, axis=0)
         nmea_msg = []
diff --git a/echopype/convert/set_groups_ek60.py b/echopype/convert/set_groups_ek60.py
index d03bac109..5e8df4b9f 100644
--- a/echopype/convert/set_groups_ek60.py
+++ b/echopype/convert/set_groups_ek60.py
@@ -218,7 +218,7 @@ def set_platform(self, NMEA_only=False) -> xr.Dataset:
 
         # Collect variables
         # Read lat/long from NMEA datagram
-        time1, msg_type, lat, lon = self._parse_NMEA()
+        time1, msg_type, lat, lon = self._extract_NMEA_latlon()
 
         # NMEA dataset: variables filled with nan if do not exist
         ds = xr.Dataset(
diff --git a/echopype/convert/set_groups_ek80.py b/echopype/convert/set_groups_ek80.py
index e8ce4f047..c8a95fad5 100644
--- a/echopype/convert/set_groups_ek80.py
+++ b/echopype/convert/set_groups_ek80.py
@@ -228,7 +228,7 @@ def set_platform(self) -> xr.Dataset:
             water_level = np.nan
             print("WARNING: The water_level_draft was not in the file. " "Value set to NaN.")
 
-        time1, msg_type, lat, lon = self._parse_NMEA()
+        time1, msg_type, lat, lon = self._extract_NMEA_latlon()
         time2 = self.parser_obj.mru.get("timestamp", None)
         time2 = np.array(time2) if time2 is not None else [np.nan]
 

From ad6dbc7f79458255ba93e68fbdd5db91bceb2780 Mon Sep 17 00:00:00 2001
From: Don Setiawan <landungs@uw.edu>
Date: Thu, 11 Aug 2022 12:43:43 -0700
Subject: [PATCH 19/23] Add logging and optional printouts (#772)

* Add logging and optional printouts

* Fix autoimport

* Tweak propagation and warning check on plot water_level

* Make _init_logger consistent

* Missed _init_logger

* Fix missed access w/in test

* Update echopype/tests/visualize/test_plot.py

Co-authored-by: Don Setiawan <landungs@uw.edu>

* Modify logic based on @emiliom suggestions

Co-authored-by: Emilio Mayorga <emiliomayorga@gmail.com>
---
 echopype/__init__.py                          |   4 +
 echopype/calibrate/api.py                     |   9 +-
 echopype/calibrate/calibrate_ek.py            |   7 +-
 echopype/convert/api.py                       |  17 +--
 echopype/convert/parse_azfp.py                |  10 +-
 echopype/convert/parse_base.py                |  18 +--
 echopype/convert/set_groups_ek60.py           |   8 +-
 echopype/convert/set_groups_ek80.py           |   5 +-
 echopype/convert/utils/ek_raw_io.py           |  32 ++---
 echopype/convert/utils/ek_raw_parsers.py      |  26 ++--
 echopype/echodata/combine.py                  |   6 +-
 echopype/echodata/echodata.py                 |   5 +-
 .../sensor_ep_version_mapping/v05x_to_v06x.py |   5 +-
 echopype/tests/utils/test_utils_log.py        | 105 ++++++++++++++++
 echopype/tests/visualize/test_plot.py         |  47 +++-----
 echopype/utils/io.py                          |  13 +-
 echopype/utils/log.py                         | 114 ++++++++++++++++++
 echopype/visualize/api.py                     |  12 +-
 echopype/visualize/plot.py                    |   6 +-
 19 files changed, 345 insertions(+), 104 deletions(-)
 create mode 100644 echopype/tests/utils/test_utils_log.py
 create mode 100644 echopype/utils/log.py

diff --git a/echopype/__init__.py b/echopype/__init__.py
index 03adaafe2..31bd5df62 100644
--- a/echopype/__init__.py
+++ b/echopype/__init__.py
@@ -6,6 +6,9 @@
 from .convert.api import open_raw
 from .echodata.api import open_converted
 from .echodata.combine import combine_echodata
+from .utils.log import verbose
+
+verbose(override=True)
 
 __all__ = [
     "open_raw",
@@ -15,4 +18,5 @@
     "consolidate",
     "preprocess",
     "utils",
+    "verbose",
 ]
diff --git a/echopype/calibrate/api.py b/echopype/calibrate/api.py
index 6c5805ad7..979091e32 100644
--- a/echopype/calibrate/api.py
+++ b/echopype/calibrate/api.py
@@ -1,8 +1,7 @@
-import warnings
-
 import xarray as xr
 
 from ..echodata import EchoData
+from ..utils.log import _init_logger
 from ..utils.prov import echopype_prov_attrs, source_files_vars
 from .calibrate_azfp import CalibrateAZFP
 from .calibrate_ek import CalibrateEK60, CalibrateEK80
@@ -16,6 +15,8 @@
     "EA640": CalibrateEK80,
 }
 
+logger = _init_logger(__name__)
+
 
 def _compute_cal(
     cal_type,
@@ -39,12 +40,12 @@ def _compute_cal(
             )
     elif echodata.sonar_model in ("EK60", "AZFP"):
         if waveform_mode is not None and waveform_mode != "CW":
-            warnings.warn(
+            logger.warning(
                 "This sonar model transmits only narrowband signals (waveform_mode='CW'). "
                 "Calibration will be in CW mode",
             )
         if encode_mode is not None and encode_mode != "power":
-            warnings.warn(
+            logger.warning(
                 "This sonar model only record data as power or power/angle samples "
                 "(encode_mode='power'). Calibration will be done on the power samples.",
             )
diff --git a/echopype/calibrate/calibrate_ek.py b/echopype/calibrate/calibrate_ek.py
index 42eacf3ba..706be6d34 100644
--- a/echopype/calibrate/calibrate_ek.py
+++ b/echopype/calibrate/calibrate_ek.py
@@ -4,8 +4,11 @@
 
 from ..echodata import EchoData
 from ..utils import uwa
+from ..utils.log import _init_logger
 from .calibrate_base import CAL_PARAMS, CalibrateBase
 
+logger = _init_logger(__name__)
+
 
 class CalibrateEK(CalibrateBase):
     def __init__(self, echodata: EchoData, env_params):
@@ -930,11 +933,11 @@ def _compute_cal(self, cal_type, waveform_mode, encode_mode) -> xr.Dataset:
 
             if encode_mode == "power":
                 use_beam_power = True  # switch source of backscatter data
-                print(
+                logger.info(
                     "Only power samples are calibrated, but complex samples also exist in the raw data file!"  # noqa
                 )
             else:
-                print(
+                logger.info(
                     "Only complex samples are calibrated, but power samples also exist in the raw data file!"  # noqa
                 )
         else:  # only power OR complex samples exist
diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index 9400d7019..5f28e771f 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -1,5 +1,4 @@
 import warnings
-from datetime import datetime as dt
 from pathlib import Path
 from typing import TYPE_CHECKING, Dict, Optional, Tuple
 
@@ -17,6 +16,7 @@
 # fmt: on
 from ..echodata.echodata import XARRAY_ENGINE_MAP, EchoData
 from ..utils import io
+from ..utils.log import _init_logger
 
 COMPRESSION_SETTINGS = {
     "netcdf4": {"zlib": True, "complevel": 4},
@@ -27,6 +27,9 @@
 
 BEAM_SUBGROUP_DEFAULT = "Beam_group1"
 
+# Logging setup
+logger = _init_logger(__name__)
+
 
 def to_file(
     echodata: EchoData,
@@ -76,15 +79,15 @@ def to_file(
 
     # Sequential or parallel conversion
     if exists and not overwrite:
-        print(
-            f"{dt.now().strftime('%H:%M:%S')}  {echodata.source_file} has already been converted to {engine}. "  # noqa
+        logger.info(
+            f"{echodata.source_file} has already been converted to {engine}. "  # noqa
             f"File saving not executed."
         )
     else:
         if exists:
-            print(f"{dt.now().strftime('%H:%M:%S')}  overwriting {output_file}")
+            logger.info(f"overwriting {output_file}")
         else:
-            print(f"{dt.now().strftime('%H:%M:%S')}  saving {output_file}")
+            logger.info(f"saving {output_file}")
         _save_groups_to_file(
             echodata,
             output_path=io.sanitize_file_path(
@@ -362,7 +365,7 @@ def open_raw(
     EchoData object
     """
     if (sonar_model is None) and (raw_file is None):
-        print("Please specify the path to the raw data file and the sonar model.")
+        logger.warning("Please specify the path to the raw data file and the sonar model.")
         return
 
     # Check inputs
@@ -371,7 +374,7 @@ def open_raw(
     storage_options = storage_options if storage_options is not None else {}
 
     if sonar_model is None:
-        print("Please specify the sonar model.")
+        logger.warning("Please specify the sonar model.")
 
         if xml_path is None:
             sonar_model = "EK60"
diff --git a/echopype/convert/parse_azfp.py b/echopype/convert/parse_azfp.py
index 0bffb380e..35371b84a 100644
--- a/echopype/convert/parse_azfp.py
+++ b/echopype/convert/parse_azfp.py
@@ -8,10 +8,13 @@
 import fsspec
 import numpy as np
 
+from ..utils.log import _init_logger
 from .parse_base import ParseBase
 
 FILENAME_DATETIME_AZFP = "\\w+.01A"
 
+logger = _init_logger(__name__)
+
 
 class ParseAZFP(ParseBase):
     """Class for converting data from ASL Environmental Sciences AZFP echosounder."""
@@ -273,10 +276,7 @@ def _print_status(self):
         )
         timestr = timestamp.strftime("%Y-%b-%d %H:%M:%S")
         pathstr, xml_name = os.path.split(self.xml_path)
-        print(
-            f"{dt.now().strftime('%H:%M:%S')}  parsing file {filename} with {xml_name}, "
-            f"time of first ping: {timestr}"
-        )
+        logger.info(f"parsing file {filename} with {xml_name}, " f"time of first ping: {timestr}")
 
     def _split_header(self, raw, header_unpacked):
         """Splits the header information into a dictionary.
@@ -300,7 +300,7 @@ def _split_header(self, raw, header_unpacked):
         ):  # first field should match hard-coded FILE_TYPE from manufacturer
             check_eof = raw.read(1)
             if check_eof:
-                print("Error: Unknown file type")
+                logger.error("Unknown file type")
                 return False
         header_byte_cnt = 0
 
diff --git a/echopype/convert/parse_base.py b/echopype/convert/parse_base.py
index c7dbafe46..cd2f19d3d 100644
--- a/echopype/convert/parse_base.py
+++ b/echopype/convert/parse_base.py
@@ -4,12 +4,15 @@
 
 import numpy as np
 
+from ..utils.log import _init_logger
 from .utils.ek_raw_io import RawSimradFile, SimradEOF
 
 FILENAME_DATETIME_EK60 = (
     "(?P<survey>.+)?-?D(?P<date>\\w{1,8})-T(?P<time>\\w{1,6})-?(?P<postfix>\\w+)?.raw"
 )
 
+logger = _init_logger(__name__)
+
 
 class ParseBase:
     """Parent class for all convert classes."""
@@ -54,9 +57,8 @@ def __init__(self, file, params, storage_options):
 
     def _print_status(self):
         time = self.config_datagram["timestamp"].astype(dt).strftime("%Y-%b-%d %H:%M:%S")
-        print(
-            f"{dt.now().strftime('%H:%M:%S')}  parsing file {os.path.basename(self.source_file)}, "
-            f"time of first ping: {time}"
+        logger.info(
+            f"parsing file {os.path.basename(self.source_file)}, " f"time of first ping: {time}"
         )
 
     def parse_raw(self):
@@ -80,7 +82,7 @@ def parse_raw(self):
                     xml_type = "environment"
                 elif "CONFIG" in self.data_type:
                     xml_type = "configuration"
-                print(f"{dt.now().strftime('%H:%M:%S')} exporting {xml_type} XML file")
+                logger.info(f"exporting {xml_type} XML file")
                 # Don't parse anything else if only the config xml is required.
                 if "CONFIG" in self.data_type:
                     return
@@ -314,18 +316,18 @@ def _read_datagrams(self, fid):
 
             # TAG datagrams contain time-stamped annotations inserted via the recording software
             elif new_datagram["type"].startswith("TAG"):
-                print("TAG datagram encountered.")
+                logger.info("TAG datagram encountered.")
 
             # BOT datagrams contain sounder detected bottom depths from .bot files
             elif new_datagram["type"].startswith("BOT"):
-                print("BOT datagram encountered.")
+                logger.info("BOT datagram encountered.")
 
             # DEP datagrams contain sounder detected bottom depths from .out files
             # as well as reflectivity data
             elif new_datagram["type"].startswith("DEP"):
-                print("DEP datagram encountered.")
+                logger.info("DEP datagram encountered.")
             else:
-                print("Unknown datagram type: " + str(new_datagram["type"]))
+                logger.info("Unknown datagram type: " + str(new_datagram["type"]))
 
     def _append_channel_ping_data(self, datagram, rx=True):
         """
diff --git a/echopype/convert/set_groups_ek60.py b/echopype/convert/set_groups_ek60.py
index 5e8df4b9f..bc44a13c5 100644
--- a/echopype/convert/set_groups_ek60.py
+++ b/echopype/convert/set_groups_ek60.py
@@ -1,10 +1,10 @@
-import warnings
 from collections import defaultdict
 
 import numpy as np
 import xarray as xr
 
 from ..utils.coding import set_encodings
+from ..utils.log import _init_logger
 from ..utils.prov import echopype_prov_attrs, source_files_vars
 
 # fmt: off
@@ -12,6 +12,8 @@
 
 # fmt: on
 
+logger = _init_logger(__name__)
+
 
 class SetGroupsEK60(SetGroupsBase):
     """Class for saving groups to netcdf or zarr from EK60 data files."""
@@ -84,7 +86,7 @@ def __init__(self, *args, **kwargs):
                     backscatter_r[all_duplicates_idx[0]],
                     backscatter_r[all_duplicates_idx[1]],
                 ):
-                    warnings.warn(
+                    logger.warning(
                         "duplicate pings with identical values detected; the duplicate pings will be removed"  # noqa
                     )
                     for v in self.parser_obj.ping_data_dict.values():
@@ -96,7 +98,7 @@ def __init__(self, *args, **kwargs):
                             v[ch] = [v[ch][i] for i in unique_idx]
                     self.parser_obj.ping_time[ch] = self.parser_obj.ping_time[ch][unique_idx]
                 else:
-                    warnings.warn(
+                    logger.warning(
                         "duplicate ping times detected; the duplicate times will be incremented by 1 nanosecond and remain in the ping_time coordinate. The original ping times will be preserved in the Provenance group"  # noqa
                     )
 
diff --git a/echopype/convert/set_groups_ek80.py b/echopype/convert/set_groups_ek80.py
index c8a95fad5..40f8f1fb3 100644
--- a/echopype/convert/set_groups_ek80.py
+++ b/echopype/convert/set_groups_ek80.py
@@ -5,8 +5,11 @@
 import xarray as xr
 
 from ..utils.coding import set_encodings
+from ..utils.log import _init_logger
 from .set_groups_base import SetGroupsBase
 
+logger = _init_logger(__name__)
+
 
 class SetGroupsEK80(SetGroupsBase):
     """Class for saving groups to netcdf or zarr from EK80 data files."""
@@ -226,7 +229,7 @@ def set_platform(self) -> xr.Dataset:
             water_level = self.parser_obj.environment["water_level_draft"]
         else:
             water_level = np.nan
-            print("WARNING: The water_level_draft was not in the file. " "Value set to NaN.")
+            logger.info("WARNING: The water_level_draft was not in the file. Value set to NaN.")
 
         time1, msg_type, lat, lon = self._extract_NMEA_latlon()
         time2 = self.parser_obj.mru.get("timestamp", None)
diff --git a/echopype/convert/utils/ek_raw_io.py b/echopype/convert/utils/ek_raw_io.py
index dc5d32dd8..883733f11 100644
--- a/echopype/convert/utils/ek_raw_io.py
+++ b/echopype/convert/utils/ek_raw_io.py
@@ -6,18 +6,18 @@
 Contains low-level functions called by ./ek_raw_parsers.py
 """
 
-import logging
 import struct
 from io import SEEK_CUR, SEEK_END, SEEK_SET, BufferedReader, FileIO
 
 import fsspec
 from fsspec.implementations.local import LocalFileSystem
 
+from ...utils.log import _init_logger
 from . import ek_raw_parsers as parsers
 
 __all__ = ["RawSimradFile"]
 
-log = logging.getLogger(__name__)
+logger = _init_logger(__name__)
 
 
 class SimradEOF(Exception):
@@ -265,7 +265,7 @@ def _read_next_dgram(self):
 
         #  check for invalid time data
         if (header["low_date"], header["high_date"]) == (0, 0):
-            log.warning(
+            logger.warning(
                 "Skipping %s datagram w/ timestamp of (0, 0) at %sL:%d",
                 header["type"],
                 str(self._tell_bytes()),
@@ -277,7 +277,7 @@ def _read_next_dgram(self):
         #  basic sanity check on size
         if header["size"] < 16:
             #  size can't be smaller than the header size
-            log.warning(
+            logger.warning(
                 "Invalid datagram header: size: %d, type: %s, nt_date: %s.  dgram_size < 16",
                 header["size"],
                 header["type"],
@@ -303,7 +303,7 @@ def _read_next_dgram(self):
 
         #  and make sure it checks out
         if bytes_read < header["size"]:
-            log.warning(
+            logger.warning(
                 "Datagram %d (@%d) shorter than expected length:  %d < %d",
                 self.tell(),
                 old_file_pos,
@@ -324,14 +324,14 @@ def _read_next_dgram(self):
         #  make sure they match
         if header["size"] != dgram_size_check:
             # self._seek_bytes(old_file_pos, SEEK_SET)
-            log.warning(
+            logger.warning(
                 "Datagram failed size check:  %d != %d @ (%d, %d)",
                 header["size"],
                 dgram_size_check,
                 self._tell_bytes(),
                 self.tell(),
             )
-            log.warning("Skipping to next datagram...")
+            logger.warning("Skipping to next datagram...")
             self._find_next_datagram()
 
             return self._read_next_dgram()
@@ -473,17 +473,17 @@ def readall(self):
 
     def _find_next_datagram(self):
         old_file_pos = self._tell_bytes()
-        log.warning("Attempting to find next valid datagram...")
+        logger.warning("Attempting to find next valid datagram...")
 
         try:
             while self.peek()["type"][:3] not in list(self.DGRAM_TYPE_KEY.keys()):
                 self._seek_bytes(1, 1)
         except DatagramReadError:
-            log.warning("No next datagram found. Ending reading of file.")
+            logger.warning("No next datagram found. Ending reading of file.")
             raise SimradEOF()
         else:
-            log.warning("Found next datagram:  %s", self.peek())
-            log.warning("Skipped ahead %d bytes", self._tell_bytes() - old_file_pos)
+            logger.warning("Found next datagram:  %s", self.peek())
+            logger.warning("Skipped ahead %d bytes", self._tell_bytes() - old_file_pos)
 
     def tell(self):
         """
@@ -539,7 +539,7 @@ def skip(self):
         header = self.peek()
 
         if header["size"] < 16:
-            log.warning(
+            logger.warning(
                 "Invalid datagram header: size: %d, type: %s, nt_date: %s.  dgram_size < 16",
                 header["size"],
                 header["type"],
@@ -553,14 +553,14 @@ def skip(self):
             dgram_size_check = self._read_dgram_size()
 
             if header["size"] != dgram_size_check:
-                log.warning(
+                logger.warning(
                     "Datagram failed size check:  %d != %d @ (%d, %d)",
                     header["size"],
                     dgram_size_check,
                     self._tell_bytes(),
                     self.tell(),
                 )
-                log.warning("Skipping to next datagram... (in skip)")
+                logger.warning("Skipping to next datagram... (in skip)")
 
                 self._find_next_datagram()
 
@@ -590,7 +590,7 @@ def skip_back(self):
             dgram_size = self._read_dgram_size()
 
         except DatagramSizeError:
-            print("Error reading the datagram")
+            logger.info("Error reading the datagram")
             self._seek_bytes(old_file_pos, SEEK_SET)
             raise
 
@@ -615,7 +615,7 @@ def iter_dgrams(self):
             try:
                 new_dgram = next(self)
             except Exception:
-                log.debug("Caught EOF?")
+                logger.debug("Caught EOF?")
                 raise StopIteration
 
             yield new_dgram
diff --git a/echopype/convert/utils/ek_raw_parsers.py b/echopype/convert/utils/ek_raw_parsers.py
index c2f6734e9..74754bcef 100644
--- a/echopype/convert/utils/ek_raw_parsers.py
+++ b/echopype/convert/utils/ek_raw_parsers.py
@@ -7,7 +7,6 @@
 channel-transducer structure from different EK80 setups.
 """
 
-import logging
 import re
 import struct
 import sys
@@ -16,6 +15,7 @@
 
 import numpy as np
 
+from ...utils.log import _init_logger
 from .ek_date_conversion import nt_to_unix
 
 TCVR_CH_NUM_MATCHER = re.compile(r"\d{6}-\w{1,2}|\w{12}-\w{1,2}")
@@ -29,7 +29,7 @@
     "SimradRawParser",
 ]
 
-log = logging.getLogger(__name__)
+logger = _init_logger(__name__)
 
 
 class _SimradDatagramParser(object):
@@ -189,8 +189,8 @@ def _pack_contents(self, data, version):
 
             if len(set(lengths)) != 1:
                 min_indx = min(lengths)
-                log.warning("Data lengths mismatched:  d:%d, r:%d, u:%d, t:%d", *lengths)
-                log.warning("  Using minimum value:  %d", min_indx)
+                logger.warning("Data lengths mismatched:  d:%d, r:%d, u:%d, t:%d", *lengths)
+                logger.warning("  Using minimum value:  %d", min_indx)
                 data["transceiver_count"] = min_indx
 
             else:
@@ -280,7 +280,7 @@ def _pack_contents(self, data, version):
         if version == 0:
 
             if len(data["depth"]) != data["transceiver_count"]:
-                log.warning(
+                logger.warning(
                     "# of depth values %d does not match transceiver count %d",
                     len(data["depth"]),
                     data["transceiver_count"],
@@ -1326,12 +1326,12 @@ def _unpack_contents(self, raw_string, bytes_read, version):
                 transducer_header = self._transducer_headers[sounder_name]
                 _sounder_name_used = sounder_name
             except KeyError:
-                log.warning(
+                logger.warning(
                     "Unknown sounder_name:  %s, (no one of %s)",
                     sounder_name,
                     list(self._transducer_headers.keys()),
                 )
-                log.warning("Will use ER60 transducer config fields as default")
+                logger.warning("Will use ER60 transducer config fields as default")
 
                 transducer_header = self._transducer_headers["ER60"]
                 _sounder_name_used = "ER60"
@@ -1403,7 +1403,7 @@ def _pack_contents(self, data, version):
         if version == 0:
 
             if data["transceiver_count"] != len(data["transceivers"]):
-                log.warning("Mismatch between 'transceiver_count' and actual # of transceivers")
+                logger.warning("Mismatch between 'transceiver_count' and actual # of transceivers")
                 data["transceiver_count"] = len(data["transceivers"])
 
             sounder_name = data["sounder_name"]
@@ -1424,12 +1424,12 @@ def _pack_contents(self, data, version):
                 transducer_header = self._transducer_headers[sounder_name]
                 _sounder_name_used = sounder_name
             except KeyError:
-                log.warning(
+                logger.warning(
                     "Unknown sounder_name:  %s, (no one of %s)",
                     sounder_name,
                     list(self._transducer_headers.keys()),
                 )
-                log.warning("Will use ER60 transducer config fields as default")
+                logger.warning("Will use ER60 transducer config fields as default")
 
                 transducer_header = self._transducer_headers["ER60"]
                 _sounder_name_used = "ER60"
@@ -1685,19 +1685,19 @@ def _pack_contents(self, data, version):
 
             if data["count"] > 0:
                 if (int(data["mode"]) & 0x1) and (len(data.get("power", [])) != data["count"]):
-                    log.warning(
+                    logger.warning(
                         "Data 'count' = %d, but contains %d power samples.  Ignoring power."
                     )
                     data["mode"] &= ~(1 << 0)
 
                 if (int(data["mode"]) & 0x2) and (len(data.get("angle", [])) != data["count"]):
-                    log.warning(
+                    logger.warning(
                         "Data 'count' = %d, but contains %d angle samples.  Ignoring angle."
                     )
                     data["mode"] &= ~(1 << 1)
 
                 if data["mode"] == 0:
-                    log.warning(
+                    logger.warning(
                         "Data 'count' = %d, but mode == 0.  Setting count to 0",
                         data["count"],
                     )
diff --git a/echopype/echodata/combine.py b/echopype/echodata/combine.py
index 244a9da5f..141ab755f 100644
--- a/echopype/echodata/combine.py
+++ b/echopype/echodata/combine.py
@@ -1,4 +1,3 @@
-import warnings
 from pathlib import Path
 from typing import Any, Dict, List
 
@@ -8,9 +7,12 @@
 from ..core import SONAR_MODELS
 from ..qc import coerce_increasing_time, exist_reversed_time
 from ..utils.coding import set_encodings
+from ..utils.log import _init_logger
 from ..utils.prov import echopype_prov_attrs, source_files_vars
 from .echodata import EchoData
 
+logger = _init_logger(__name__)
+
 
 def union_attrs(datasets: List[xr.Dataset]) -> Dict[str, Any]:
     """
@@ -59,7 +61,7 @@ def check_and_correct_reversed_time(combined_group, old_time, new_time, time_str
 
     if time_str in combined_group and exist_reversed_time(combined_group, time_str):
         if old_time is None:
-            warnings.warn(
+            logger.warning(
                 f"{sonar_model} {time_str} reversal detected; {time_str} will be corrected"  # noqa
                 " (see https://github.com/OSOceanAcoustics/echopype/pull/297)"
             )
diff --git a/echopype/echodata/echodata.py b/echopype/echodata/echodata.py
index ce07064c6..904cdb515 100644
--- a/echopype/echodata/echodata.py
+++ b/echopype/echodata/echodata.py
@@ -16,6 +16,7 @@
 from ..calibrate.env_params import EnvParams
 from ..utils.coding import set_encodings
 from ..utils.io import check_file_existence, sanitize_file_path
+from ..utils.log import _init_logger
 from ..utils.uwa import calc_sound_speed
 from .convention import sonarnetcdf_1
 from .sensor_ep_version_mapping import ep_version_mapper
@@ -35,6 +36,8 @@
     "EA640": 0,
 }
 
+logger = _init_logger(__name__)
+
 
 class EchoData:
     """Echo data model class for handling raw converted data,
@@ -651,7 +654,7 @@ def update_platform(
             if var in platform and (~platform[var].isnull()).all():
                 dropped_vars.append(var)
         if len(dropped_vars) > 0:
-            warnings.warn(
+            logger.warning(
                 f"Some variables in the original Platform group will be overwritten: {', '.join(dropped_vars)}"  # noqa
             )
         platform = platform.drop_vars(
diff --git a/echopype/echodata/sensor_ep_version_mapping/v05x_to_v06x.py b/echopype/echodata/sensor_ep_version_mapping/v05x_to_v06x.py
index 2ef1f21ac..97165180e 100644
--- a/echopype/echodata/sensor_ep_version_mapping/v05x_to_v06x.py
+++ b/echopype/echodata/sensor_ep_version_mapping/v05x_to_v06x.py
@@ -1,4 +1,3 @@
-import warnings
 import xml.etree.ElementTree as ET
 
 import numpy as np
@@ -6,9 +5,11 @@
 
 # TODO: turn this into an absolute import!
 from ...core import SONAR_MODELS
+from ...utils.log import _init_logger
 from ..convention import sonarnetcdf_1
 
 _varattrs = sonarnetcdf_1.yaml_dict["variable_and_varattributes"]
+logger = _init_logger(__name__)
 
 
 def _get_sensor(sensor_model):
@@ -1133,7 +1134,7 @@ def convert_v05x_to_v06x(echodata_obj):
     """
 
     # TODO: put in an appropriate link to the v5 to v6 conversion outline
-    warnings.warn(
+    logger.warning(
         "Converting echopype version 0.5.x file to 0.6.0."
         " For specific details on how items have been changed,"
         " please see the echopype documentation. It is recommended "
diff --git a/echopype/tests/utils/test_utils_log.py b/echopype/tests/utils/test_utils_log.py
new file mode 100644
index 000000000..998d231f4
--- /dev/null
+++ b/echopype/tests/utils/test_utils_log.py
@@ -0,0 +1,105 @@
+import pytest
+
+EXPECTED_MESSAGE = "Testing log function"
+
+
+def logging_func(logger):
+    logger.info("Testing log function")
+
+
+@pytest.fixture(params=[False, True])
+def verbose(request):
+    return request.param
+
+
+def test_init_logger():
+    import logging
+    from echopype.utils import log
+    logger = log._init_logger('echopype.testing0')
+    handlers = [h.name for h in logger.handlers]
+
+    assert isinstance(logger, logging.Logger) is True
+    assert logger.name == 'echopype.testing0'
+    assert len(logger.handlers) == 2
+    assert log.STDERR_NAME in handlers
+    assert log.STDOUT_NAME in handlers
+
+
+def test_set_log_file():
+    from echopype.utils import log
+    logger = log._init_logger('echopype.testing1')
+    from tempfile import TemporaryDirectory
+    with TemporaryDirectory() as tmpdir:
+        tmpfile = tmpdir + "/testfile.log"
+        log._set_logfile(logger, tmpfile)
+        handlers = [h.name for h in logger.handlers]
+
+        assert log.LOGFILE_HANDLE_NAME in handlers
+
+
+def test_set_verbose(verbose, capsys):
+    from echopype.utils import log
+    logger = log._init_logger(f'echopype.testing_{str(verbose).lower()}')
+
+    # To pass through in caplog need to propagate
+    # logger.propagate = True
+
+    log._set_verbose(verbose)
+
+    logging_func(logger)
+
+    captured = capsys.readouterr()
+
+    if verbose:
+        assert EXPECTED_MESSAGE in captured.out
+    else:
+        assert "" in captured.out
+
+
+def test_get_all_loggers():
+    import logging
+    from echopype.utils import log
+    all_loggers = log._get_all_loggers()
+    loggers = [logging.getLogger()]  # get the root logger
+    loggers = loggers + [logging.getLogger(name) for name in logging.root.manager.loggerDict]
+
+    assert all_loggers == loggers
+
+
+def run_verbose_test(logger, override, logfile, capsys):
+    import echopype as ep
+    import os
+
+    ep.verbose(logfile=logfile, override=override)
+
+    logging_func(logger)
+
+    captured = capsys.readouterr()
+
+    if override is True:
+        assert captured.out == ""
+    else:
+        assert EXPECTED_MESSAGE in captured.out
+
+    if logfile is not None:
+        assert os.path.exists(logfile)
+        with open(logfile) as f:
+            assert EXPECTED_MESSAGE in f.read()
+
+
+@pytest.mark.parametrize(["id", "override", "logfile"], [
+    ("fn", True, None),
+    ("tn", False, None),
+    ("tf", False, 'test.log')
+])
+def test_verbose(id, override, logfile, capsys):
+    from echopype.utils import log
+    logger = log._init_logger(f'echopype.testing_{id}')
+
+    if logfile is not None:
+        from tempfile import TemporaryDirectory
+        with TemporaryDirectory() as tmpdir:
+            tmpfile = tmpdir + f'/{logfile}'
+            run_verbose_test(logger, override, tmpfile, capsys)
+    else:
+        run_verbose_test(logger, override, logfile, capsys)
diff --git a/echopype/tests/visualize/test_plot.py b/echopype/tests/visualize/test_plot.py
index caaa32550..e6937be64 100644
--- a/echopype/tests/visualize/test_plot.py
+++ b/echopype/tests/visualize/test_plot.py
@@ -1,3 +1,4 @@
+
 import echopype
 import echopype.visualize
 from echopype.testing import TEST_DATA_FOLDER
@@ -212,9 +213,10 @@ def test_plot_mvbs(
         (30.5, False),
     ],
 )
-def test_water_level_echodata(water_level, expect_warning):
+def test_water_level_echodata(water_level, expect_warning, caplog):
     from echopype.echodata import EchoData
     from echopype.visualize.api import _add_water_level
+    echopype.verbose()
 
     filepath = ek60_path / "ncei-wcsd" / "Summer2017-D20170719-T211347.raw"
     sonar_model = "EK60"
@@ -259,21 +261,14 @@ def test_water_level_echodata(water_level, expect_warning):
 
     results = None
     try:
+        results = _add_water_level(
+            range_in_meter=range_in_meter,
+            water_level=water_level,
+            data_type=EchoData,
+            platform_data=echodata["Platform"],
+        )
         if expect_warning:
-            with pytest.warns(UserWarning):
-                results = _add_water_level(
-                    range_in_meter=range_in_meter,
-                    water_level=water_level,
-                    data_type=EchoData,
-                    platform_data=echodata["Platform"],
-                )
-        else:
-            results = _add_water_level(
-                range_in_meter=range_in_meter,
-                water_level=water_level,
-                data_type=EchoData,
-                platform_data=echodata["Platform"],
-            )
+            assert 'WARNING' in caplog.text
     except Exception as e:
         assert isinstance(e, ValueError)
         assert str(e) == 'Water level must have any of these dimensions: channel, ping_time, range_sample'  # noqa
@@ -297,8 +292,9 @@ def test_water_level_echodata(water_level, expect_warning):
         (30.5, False),
     ],
 )
-def test_water_level_Sv_dataset(water_level, expect_warning):
+def test_water_level_Sv_dataset(water_level, expect_warning, caplog):
     from echopype.visualize.api import _add_water_level
+    echopype.verbose()
 
     filepath = ek60_path / "ncei-wcsd" / "Summer2017-D20170719-T211347.raw"
     sonar_model = "EK60"
@@ -324,19 +320,14 @@ def test_water_level_Sv_dataset(water_level, expect_warning):
 
     results = None
     try:
+        results = _add_water_level(
+            range_in_meter=range_in_meter,
+            water_level=water_level,
+            data_type=xr.Dataset,
+        )
+
         if expect_warning:
-            with pytest.warns(UserWarning):
-                results = _add_water_level(
-                    range_in_meter=range_in_meter,
-                    water_level=water_level,
-                    data_type=xr.Dataset,
-                )
-        else:
-            results = _add_water_level(
-                range_in_meter=range_in_meter,
-                water_level=water_level,
-                data_type=xr.Dataset,
-            )
+            assert 'WARNING' in caplog.text
     except Exception as e:
         assert isinstance(e, ValueError)
         assert str(e) == 'Water level must have any of these dimensions: channel, ping_time, range_sample'  # noqa
diff --git a/echopype/utils/io.py b/echopype/utils/io.py
index cf999469b..aea21eb92 100644
--- a/echopype/utils/io.py
+++ b/echopype/utils/io.py
@@ -3,7 +3,6 @@
 """
 import os
 import sys
-import warnings
 from pathlib import Path
 from typing import TYPE_CHECKING, Dict, Union
 
@@ -11,6 +10,8 @@
 from fsspec import FSMap
 from fsspec.implementations.local import LocalFileSystem
 
+from ..utils.log import _init_logger
+
 if TYPE_CHECKING:
     from ..core import PathHint
 SUPPORTED_ENGINES = {
@@ -22,6 +23,8 @@
     },
 }
 
+logger = _init_logger(__name__)
+
 
 def get_files_from_dir(folder):
     """Retrieves all Netcdf and Zarr files from a given folder"""
@@ -162,7 +165,7 @@ def validate_output_path(
     file_ext = SUPPORTED_ENGINES[engine]["ext"]
 
     if save_path is None:
-        warnings.warn("save_path is not provided")
+        logger.warning("save_path is not provided")
 
         current_dir = Path.cwd()
         # Check permission, raise exception if no permission
@@ -171,7 +174,7 @@ def validate_output_path(
         if not out_dir.exists():
             out_dir.mkdir(parents=True)
 
-        warnings.warn(f"Resulting converted file(s) will be available at {str(out_dir)}")
+        logger.warning(f"Resulting converted file(s) will be available at {str(out_dir)}")
         out_path = str(out_dir / (Path(source_file).stem + file_ext))
     elif not isinstance(save_path, Path) and not isinstance(save_path, str):
         raise TypeError("save_path must be a string or Path")
@@ -206,7 +209,7 @@ def validate_output_path(
                 final_path = Path(save_path)
                 out_path = save_path
             if final_path.suffix != file_ext:
-                warnings.warn(
+                logger.warning(
                     "Mismatch between specified engine and save_path found; forcing output format to engine."  # noqa
                 )
     return out_path
@@ -263,7 +266,7 @@ def check_file_permissions(FILE_DIR):
                 FILE_DIR = Path(FILE_DIR)
 
             if not FILE_DIR.exists():
-                warnings.warn(f"{str(FILE_DIR)} does not exist. Attempting to create it.")
+                logger.warning(f"{str(FILE_DIR)} does not exist. Attempting to create it.")
                 FILE_DIR.mkdir(exist_ok=True, parents=True)
             TEST_FILE = FILE_DIR.joinpath(Path(".permission_test"))
             TEST_FILE.write_text("testing\n")
diff --git a/echopype/utils/log.py b/echopype/utils/log.py
new file mode 100644
index 000000000..e83a9d71f
--- /dev/null
+++ b/echopype/utils/log.py
@@ -0,0 +1,114 @@
+import logging
+import sys
+from typing import List, Optional
+
+LOG_FORMAT = "{asctime}:{name}:{levelname}: {message}"
+LOG_FORMATTER = logging.Formatter(LOG_FORMAT, style="{")
+STDOUT_NAME = "stdout_stream_handler"
+STDERR_NAME = "stderr_stream_handler"
+LOGFILE_HANDLE_NAME = "logfile_file_handler"
+
+
+class _ExcludeWarningsFilter(logging.Filter):
+    def filter(self, record):  # noqa
+        """Only lets through log messages with log level below ERROR ."""
+        return record.levelno < logging.WARNING
+
+
+def verbose(logfile: Optional[str] = None, override: bool = False) -> None:
+    """Set the verbosity for echopype print outs.
+    If called it will output logs to terminal by default.
+
+    Parameters
+    ----------
+    logfile : str, optional
+        Optional string path to the desired log file.
+    override: bool
+        Boolean flag to override verbosity,
+        which turns off verbosity if the value is `False`.
+        Default is `False`.
+
+    Returns
+    -------
+    None
+    """
+    if not isinstance(override, bool):
+        raise ValueError("override argument must be a boolean!")
+    package_name = __name__.split(".")[0]  # Get the package name
+    loggers = _get_all_loggers()
+    verbose = True if override is False else False
+    _set_verbose(verbose)
+    for logger in loggers:
+        if package_name in logger.name:
+            handlers = [h.name for h in logger.handlers]
+            if logfile is None:
+                if LOGFILE_HANDLE_NAME in handlers:
+                    # Remove log file handler if it exists
+                    handler = next(filter(lambda h: h.name == LOGFILE_HANDLE_NAME, logger.handlers))
+                    logger.removeHandler(handler)
+            elif LOGFILE_HANDLE_NAME not in handlers:
+                # Only add the logfile handler if it doesn't exist
+                _set_logfile(logger, logfile)
+
+            if isinstance(logfile, str):
+                # Prevents multiple handler from propagating messages
+                # this way there are no duplicate line in logfile
+                logger.propagate = False
+            else:
+                logger.propagate = True
+
+
+def _get_all_loggers() -> List[logging.Logger]:
+    """Get all loggers"""
+    loggers = [logging.getLogger()]  # get the root logger
+    return loggers + [logging.getLogger(name) for name in logging.root.manager.loggerDict]
+
+
+def _init_logger(name) -> logging.Logger:
+    """Initialize logger with the default stdout stream handler
+
+    Parameters
+    ----------
+    name : str
+        Logger name
+
+    Returns
+    -------
+    logging.Logger
+    """
+    # Logging setup
+    logger = logging.getLogger(name)
+    logger.setLevel(logging.INFO)
+
+    # Setup stream handler
+    STREAM_HANDLER = logging.StreamHandler(sys.stdout)
+    STREAM_HANDLER.setLevel(logging.INFO)
+    STREAM_HANDLER.set_name(STDOUT_NAME)
+    STREAM_HANDLER.setFormatter(LOG_FORMATTER)
+    STREAM_HANDLER.addFilter(_ExcludeWarningsFilter())
+    logger.addHandler(STREAM_HANDLER)
+
+    # Setup err stream handler
+    ERR_STREAM_HANDLER = logging.StreamHandler(sys.stderr)
+    ERR_STREAM_HANDLER.setLevel(logging.WARNING)
+    ERR_STREAM_HANDLER.set_name(STDERR_NAME)
+    ERR_STREAM_HANDLER.setFormatter(LOG_FORMATTER)
+    logger.addHandler(ERR_STREAM_HANDLER)
+    return logger
+
+
+def _set_verbose(verbose: bool) -> None:
+    if not verbose:
+        logging.disable(logging.WARNING)
+    else:
+        logging.disable(logging.NOTSET)
+
+
+def _set_logfile(logger: logging.Logger, logfile: Optional[str] = None) -> logging.Logger:
+    """Adds log file handler to logger"""
+    if not logfile:
+        raise ValueError("Please provide logfile path")
+    file_handler = logging.FileHandler(logfile)
+    file_handler.set_name(LOGFILE_HANDLE_NAME)
+    file_handler.setFormatter(LOG_FORMATTER)
+    logger.addHandler(file_handler)
diff --git a/echopype/visualize/api.py b/echopype/visualize/api.py
index 992aa323d..fad561eb0 100644
--- a/echopype/visualize/api.py
+++ b/echopype/visualize/api.py
@@ -1,10 +1,12 @@
-import warnings
 from typing import Optional, Union, List
 
 import xarray as xr
 
 from .plot import _plot_echogram, FacetGrid, QuadMesh
 from ..echodata import EchoData
+from ..utils.log import _init_logger
+
+logger = _init_logger(__name__)
 
 
 def create_echogram(
@@ -68,7 +70,7 @@ def create_echogram(
     }
 
     if channel and frequency:
-        warnings.warn(
+        logger.warning(
             "Both channel and frequency are specified. Channel filtering will be used."
         )
 
@@ -200,7 +202,7 @@ def _add_water_level(
     if isinstance(water_level, bool):
         if water_level is True:
             if data_type == xr.Dataset:
-                warnings.warn(
+                logger.warning(
                     "Boolean type found for water level. Ignored since data is an xarray dataset."
                 )
                 return range_in_meter
@@ -211,11 +213,11 @@ def _add_water_level(
                 ):
                     return range_in_meter + platform_data.water_level.rename({'time3': 'ping_time'})
                 else:
-                    warnings.warn(
+                    logger.warning(
                         "Boolean type found for water level. Please provide platform data with water level in it or provide a separate water level data."  # noqa
                     )
                     return range_in_meter
-        warnings.warn(f"Water level value of {water_level} is ignored.")
+        logger.warning(f"Water level value of {water_level} is ignored.")
         return range_in_meter
     if isinstance(water_level, xr.DataArray):
         check_dims = range_in_meter.dims
diff --git a/echopype/visualize/plot.py b/echopype/visualize/plot.py
index 39f924c5d..b2a3a9b23 100644
--- a/echopype/visualize/plot.py
+++ b/echopype/visualize/plot.py
@@ -1,4 +1,3 @@
-import warnings
 import matplotlib.pyplot as plt
 import matplotlib.cm
 import xarray as xr
@@ -7,6 +6,9 @@
 from matplotlib.collections import QuadMesh
 from typing import Optional, Union, List
 from .cm import cmap_d
+from ..utils.log import _init_logger
+
+logger = _init_logger(__name__)
 
 
 def _format_axis_label(axis_variable):
@@ -114,7 +116,7 @@ def _set_plot_defaults(kwargs):
     exclude_attrs = ['x', 'y', 'col', 'row']
     for attr in exclude_attrs:
         if attr in kwargs:
-            warnings.warn(f"{attr} in kwargs. Removing.")
+            logger.warning(f"{attr} in kwargs. Removing.")
             kwargs.pop(attr)
 
     return kwargs

From a099ee571043c2d9dd49303574a574af7fbf0db9 Mon Sep 17 00:00:00 2001
From: b-reyes <53541061+b-reyes@users.noreply.github.com>
Date: Thu, 11 Aug 2022 15:16:49 -0700
Subject: [PATCH 20/23] Add functionality that directly writes variables to a
 temporary zarr store (#774)

* add initial code to grab the appropriate parsed data

* establish initial structure to go from parsed to zarr

* modify open_raw routines for EK60/80 so that we can load in zarr arrays directly into the EchoData object and the temp zarr directory persists until EchoData object is completely destroyed

* change distribution of times to a red-robin like distribution

* take first step towards generalizing the parsed_to_zarr module

* generalize write_df_to_zarr so it can handle columns without arrays, begin documenting parsed_to_zarr, and add the padding of range_sample in get_np_chunk

* finish cleaning up the code in parsed_to_zarr

* improve chunking in parsed_to_zarr and change num_mb to max_mb

* make a preliminary attempt at writing complex data

* start the restructuring of parsed_to_zarr into a class

* finish parsed to zarr reorganization for EK60 and EK80

* document and clean up the code associated with set_groups_ek60

* clean up parse_base and make it so that we do not store zarr variables twice

* move get_power_dataarray and get_angle_dataarrays to set_groups_base, modify the inputs of these functions, and begin working on set_groups_ek80 for straight to zarr

* obtain partially working version of Beam_group2 for EK80

* finish constructing ds_beam_power when zarr variables are present

* add method to get complex data arrays from zarr in set_groups_base

* generalize parsed_to_zarr so we can have column elements with multi-dimensional arrays

* finish get_ds_complex_zarr in set_groups_ek80

* add open_raw zarr variables to api and create a routine that automatically determines if large variables should be written to a temporary zarr store

* modify the condition for when we should write directly to a temporary zarr store

* only store zarr varriables when we do not have receieve data, add structure for direct to zarr unit tests, run pre-commit on all files

* change all occurances of parser2zarr to parsed2zarr

* correct zarr typo in _append_channel_ping_data

* correct kwarg in rectangularize_data

* replace hardwired time dtype in parsed to zarr with times.dtype.str

* add return docstrings and types to a couple of functions in parsed_to_zarr.py

* add back the EK60 file description in the test_data readme

* add pytest.mark.skip to unit test

Co-authored-by: Don Setiawan <landungs@uw.edu>

* remove xfail and add pass to unit test

Co-authored-by: Don Setiawan <landungs@uw.edu>

* remove pandas from requirements file

* Add simple test for noaa file

* remove the auto option in open_raw

* remove Union typing import

* add test_data/README.md lines back in

* add spaces in test_data/README.md

* remove optional typing for `offload_to_zarr`

Co-authored-by: Don Setiawan <landungs@uw.edu>

* remove auto description in notes and add beta statement in open_raw

Co-authored-by: Don Setiawan <landungs@uw.edu>
---
 echopype/convert/api.py                       | 111 +++--
 echopype/convert/parse_ad2cp.py               |   9 +-
 echopype/convert/parse_azfp.py                |   2 +-
 echopype/convert/parse_base.py                | 123 +++--
 echopype/convert/parse_ek60.py                |   4 +-
 echopype/convert/parse_ek80.py                |   4 +-
 echopype/convert/parsed_to_zarr.py            | 419 ++++++++++++++++++
 echopype/convert/parsed_to_zarr_ek60.py       | 272 ++++++++++++
 echopype/convert/parsed_to_zarr_ek80.py       | 293 ++++++++++++
 echopype/convert/set_groups_base.py           | 224 +++++++++-
 echopype/convert/set_groups_ek60.py           | 268 ++++++-----
 echopype/convert/set_groups_ek80.py           | 384 +++++++++++-----
 echopype/core.py                              |  28 ++
 echopype/echodata/api.py                      |   5 +-
 echopype/echodata/echodata.py                 |   4 +
 echopype/tests/convert/test_parsed_to_zarr.py | 111 +++++
 requirements.txt                              |   2 +
 17 files changed, 1987 insertions(+), 276 deletions(-)
 create mode 100644 echopype/convert/parsed_to_zarr.py
 create mode 100644 echopype/convert/parsed_to_zarr_ek60.py
 create mode 100644 echopype/convert/parsed_to_zarr_ek80.py
 create mode 100644 echopype/tests/convert/test_parsed_to_zarr.py

diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index 5f28e771f..243cb8e07 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -10,6 +10,7 @@
 # fmt: off
 # black and isort have conflicting ideas about how this should be formatted
 from ..core import SONAR_MODELS
+from .parsed_to_zarr import Parsed2Zarr
 
 if TYPE_CHECKING:
     from ..core import EngineHint, PathHint, SonarModelsHint
@@ -111,9 +112,10 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     # Environment group
     if "time1" in echodata["Environment"]:
         io.save_file(
-            echodata["Environment"].chunk(
-                {"time1": DEFAULT_CHUNK_SIZE["ping_time"]}
-            ),  # TODO: chunking necessary?
+            # echodata["Environment"].chunk(
+            #     {"time1": DEFAULT_CHUNK_SIZE["ping_time"]}
+            # ),  # TODO: chunking necessary?
+            echodata["Environment"],
             path=output_path,
             mode="a",
             engine=engine,
@@ -171,11 +173,12 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     if echodata.sonar_model == "AD2CP":
         for i in range(1, len(echodata["Sonar"]["beam_group"]) + 1):
             io.save_file(
-                echodata[f"Sonar/Beam_group{i}"].chunk(
-                    {
-                        "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
-                    }
-                ),
+                # echodata[f"Sonar/Beam_group{i}"].chunk(
+                #     {
+                #         "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
+                #     }
+                # ),
+                echodata[f"Sonar/Beam_group{i}"],
                 path=output_path,
                 mode="a",
                 engine=engine,
@@ -184,12 +187,13 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
             )
     else:
         io.save_file(
-            echodata[f"Sonar/{BEAM_SUBGROUP_DEFAULT}"].chunk(
-                {
-                    "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
-                    "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
-                }
-            ),
+            # echodata[f"Sonar/{BEAM_SUBGROUP_DEFAULT}"].chunk(
+            #     {
+            #         "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
+            #         "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
+            #     }
+            # ),
+            echodata[f"Sonar/{BEAM_SUBGROUP_DEFAULT}"],
             path=output_path,
             mode="a",
             engine=engine,
@@ -199,12 +203,13 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
         if echodata["Sonar/Beam_group2"] is not None:
             # some sonar model does not produce Sonar/Beam_group2
             io.save_file(
-                echodata["Sonar/Beam_group2"].chunk(
-                    {
-                        "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
-                        "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
-                    }
-                ),
+                # echodata["Sonar/Beam_group2"].chunk(
+                #     {
+                #         "range_sample": DEFAULT_CHUNK_SIZE["range_sample"],
+                #         "ping_time": DEFAULT_CHUNK_SIZE["ping_time"],
+                #     }
+                # ),
+                echodata["Sonar/Beam_group2"],
                 path=output_path,
                 mode="a",
                 engine=engine,
@@ -215,9 +220,10 @@ def _save_groups_to_file(echodata, output_path, engine, compress=True):
     # Vendor_specific group
     if "ping_time" in echodata["Vendor_specific"]:
         io.save_file(
-            echodata["Vendor_specific"].chunk(
-                {"ping_time": DEFAULT_CHUNK_SIZE["ping_time"]}
-            ),  # TODO: chunking necessary?
+            # echodata["Vendor_specific"].chunk(
+            #     {"ping_time": DEFAULT_CHUNK_SIZE["ping_time"]}
+            # ),  # TODO: chunking necessary?
+            echodata["Vendor_specific"],
             path=output_path,
             mode="a",
             engine=engine,
@@ -331,6 +337,8 @@ def open_raw(
     xml_path: Optional["PathHint"] = None,
     convert_params: Optional[Dict[str, str]] = None,
     storage_options: Optional[Dict[str, str]] = None,
+    offload_to_zarr: bool = False,
+    max_zarr_mb: int = 100,
 ) -> Optional[EchoData]:
     """Create an EchoData object containing parsed data from a single raw data file.
 
@@ -359,10 +367,22 @@ def open_raw(
         and need to be added to the converted file
     storage_options : dict
         options for cloud storage
+    offload_to_zarr: bool
+        If True, variables with a large memory footprint will be
+        written to a temporary zarr store.
+    max_zarr_mb : int
+        maximum MB that each zarr chunk should hold, when offloading
+        variables with a large memory footprint to a temporary zarr store
 
     Returns
     -------
     EchoData object
+
+    Notes
+    -----
+    ``offload_to_zarr=True`` is only available for the following
+    echosounders: EK60, ES70, EK80, ES80, EA640. Additionally, this feature
+    is currently in beta.
     """
     if (sonar_model is None) and (raw_file is None):
         logger.warning("Please specify the path to the raw data file and the sonar model.")
@@ -418,6 +438,15 @@ def open_raw(
     # Check file extension and existence
     file_chk, xml_chk = _check_file(raw_file, sonar_model, xml_path, storage_options)
 
+    # TODO: remove once 'auto' option is added
+    if not isinstance(offload_to_zarr, bool):
+        raise ValueError("offload_to_zarr must be of type bool.")
+
+    # Ensure offload_to_zarr is 'auto', if it is a string
+    # TODO: use the following when we allow for 'auto' option
+    # if isinstance(offload_to_zarr, str) and offload_to_zarr != "auto":
+    #     raise ValueError("offload_to_zarr must be a bool or equal to 'auto'.")
+
     # TODO: the if-else below only works for the AZFP vs EK contrast,
     #  but is brittle since it is abusing params by using it implicitly
     if SONAR_MODELS[sonar_model]["xml"]:
@@ -425,17 +454,47 @@ def open_raw(
     else:
         params = "ALL"  # reserved to control if only wants to parse a certain type of datagram
 
+    # obtain dict associated with directly writing to zarr
+    dgram_zarr_vars = SONAR_MODELS[sonar_model]["dgram_zarr_vars"]
+
     # Parse raw file and organize data into groups
     parser = SONAR_MODELS[sonar_model]["parser"](
-        file_chk, params=params, storage_options=storage_options
+        file_chk, params=params, storage_options=storage_options, dgram_zarr_vars=dgram_zarr_vars
     )
+
     parser.parse_raw()
+
+    # code block corresponding to directly writing parsed data to zarr
+    if offload_to_zarr and (sonar_model in ["EK60", "ES70", "EK80", "ES80", "EA640"]):
+
+        # Determines if writing to zarr is necessary and writes to zarr
+        p2z = SONAR_MODELS[sonar_model]["parsed2zarr"](parser)
+
+        # TODO: perform more robust tests for the 'auto' heuristic value
+        if offload_to_zarr == "auto" and p2z.write_to_zarr(mem_mult=0.4):
+            p2z.datagram_to_zarr(max_mb=max_zarr_mb)
+        elif offload_to_zarr is True:
+            p2z.datagram_to_zarr(max_mb=max_zarr_mb)
+        else:
+            del p2z
+            p2z = Parsed2Zarr(parser)
+            if "ALL" in parser.data_type:
+                parser.rectangularize_data()
+
+    else:
+        p2z = Parsed2Zarr(parser)
+        if (sonar_model in ["EK60", "ES70", "EK80", "ES80", "EA640"]) and (
+            "ALL" in parser.data_type
+        ):
+            parser.rectangularize_data()
+
     setgrouper = SONAR_MODELS[sonar_model]["set_groups"](
         parser,
         input_file=file_chk,
         output_path=None,
         sonar_model=sonar_model,
         params=_set_convert_params(convert_params),
+        parsed2zarr_obj=p2z,
     )
 
     # Setup tree dictionary
@@ -482,7 +541,9 @@ def open_raw(
     # Create tree and echodata
     # TODO: make the creation of tree dynamically generated from yaml
     tree = DataTree.from_dict(tree_dict, name="root")
-    echodata = EchoData(source_file=file_chk, xml_path=xml_chk, sonar_model=sonar_model)
+    echodata = EchoData(
+        source_file=file_chk, xml_path=xml_chk, sonar_model=sonar_model, parsed2zarr_obj=p2z
+    )
     echodata._set_tree(tree)
     echodata._load_tree()
 
diff --git a/echopype/convert/parse_ad2cp.py b/echopype/convert/parse_ad2cp.py
index deffd1f87..9cfa908e0 100644
--- a/echopype/convert/parse_ad2cp.py
+++ b/echopype/convert/parse_ad2cp.py
@@ -219,13 +219,8 @@ class NoMorePackets(Exception):
 
 
 class ParseAd2cp(ParseBase):
-    def __init__(
-        self,
-        *args,
-        params,
-        **kwargs,
-    ):
-        super().__init__(*args, **kwargs)
+    def __init__(self, file, params, storage_options={}, dgram_zarr_vars={}):
+        super().__init__(file, storage_options)
         self.config = None
         self.packets: List[Ad2cpDataPacket] = []
 
diff --git a/echopype/convert/parse_azfp.py b/echopype/convert/parse_azfp.py
index 35371b84a..6bd3e914b 100644
--- a/echopype/convert/parse_azfp.py
+++ b/echopype/convert/parse_azfp.py
@@ -19,7 +19,7 @@
 class ParseAZFP(ParseBase):
     """Class for converting data from ASL Environmental Sciences AZFP echosounder."""
 
-    def __init__(self, file, params, storage_options={}):
+    def __init__(self, file, params, storage_options={}, dgram_zarr_vars={}):
         super().__init__(file, storage_options)
         # Parent class attributes
         #  regex pattern used to grab datetime embedded in filename
diff --git a/echopype/convert/parse_base.py b/echopype/convert/parse_base.py
index cd2f19d3d..da03f9f67 100644
--- a/echopype/convert/parse_base.py
+++ b/echopype/convert/parse_base.py
@@ -22,6 +22,7 @@ def __init__(self, file, storage_options):
         self.timestamp_pattern = None  # regex pattern used to grab datetime embedded in filename
         self.ping_time = []  # list to store ping time
         self.storage_options = storage_options
+        self.zarr_datagrams = []  # holds all parsed datagrams
 
     def _print_status(self):
         """Prints message to console giving information about the raw file being parsed."""
@@ -30,7 +31,7 @@ def _print_status(self):
 class ParseEK(ParseBase):
     """Class for converting data from Simrad echosounders."""
 
-    def __init__(self, file, params, storage_options):
+    def __init__(self, file, params, storage_options, dgram_zarr_vars):
         super().__init__(file, storage_options)
 
         # Parent class attributes
@@ -55,12 +56,47 @@ def __init__(self, file, params, storage_options):
 
         self.CON1_datagram = None  # Holds the ME70 CON1 datagram
 
+        # dgram vars and their associated dims that should be written directly to zarr
+        self.dgram_zarr_vars = dgram_zarr_vars
+
     def _print_status(self):
         time = self.config_datagram["timestamp"].astype(dt).strftime("%Y-%b-%d %H:%M:%S")
         logger.info(
             f"parsing file {os.path.basename(self.source_file)}, " f"time of first ping: {time}"
         )
 
+    def rectangularize_data(self):
+        """
+        Rectangularize the power, angle, and complex data.
+        Additionally, convert the data to a numpy array
+        indexed by channel.
+        """
+
+        # append zarr datagrams to channel ping data
+        for dgram in self.zarr_datagrams:
+            self._append_channel_ping_data(dgram, zarr_vars=False)
+
+        # Rectangularize all data and convert to numpy array indexed by channel
+        for data_type in ["power", "angle", "complex"]:
+            # Receive data
+            for k, v in self.ping_data_dict[data_type].items():
+                if all(
+                    (x is None) or (x.size == 0) for x in v
+                ):  # if no data in a particular channel
+                    self.ping_data_dict[data_type][k] = None
+                else:
+                    # Sort complex and power/angle channels and pad NaN
+                    self.ch_ids[data_type].append(k)
+                    self.ping_data_dict[data_type][k] = self.pad_shorter_ping(v)
+            # Transmit data
+            for k, v in self.ping_data_dict_tx[data_type].items():
+                if all(
+                    (x is None) or (x.size == 0) for x in v
+                ):  # if no data in a particular channel
+                    self.ping_data_dict_tx[data_type][k] = None
+                else:
+                    self.ping_data_dict_tx[data_type][k] = self.pad_shorter_ping(v)
+
     def parse_raw(self):
         """Parse raw data file from Simrad EK60, EK80, and EA640 echosounders."""
         with RawSimradFile(self.source_file, "r", storage_options=self.storage_options) as fid:
@@ -109,34 +145,6 @@ def parse_raw(self):
             for ch, val in self.ping_time.items():
                 self.ping_time[ch] = np.array(val)
 
-            # Manufacturer-specific power conversion factor
-            INDEX2POWER = 10.0 * np.log10(2.0) / 256.0
-
-            # Rectangularize all data and convert to numpy array indexed by channel
-            for data_type in ["power", "angle", "complex"]:
-                # Receive data
-                for k, v in self.ping_data_dict[data_type].items():
-                    if all(
-                        (x is None) or (x.size == 0) for x in v
-                    ):  # if no data in a particular channel
-                        self.ping_data_dict[data_type][k] = None
-                    else:
-                        # Sort complex and power/angle channels and pad NaN
-                        self.ch_ids[data_type].append(k)
-                        self.ping_data_dict[data_type][k] = self.pad_shorter_ping(v)
-                        if data_type == "power":
-                            self.ping_data_dict[data_type][k] = (
-                                self.ping_data_dict[data_type][k].astype("float32") * INDEX2POWER
-                            )
-                # Transmit data
-                for k, v in self.ping_data_dict_tx[data_type].items():
-                    if all(
-                        (x is None) or (x.size == 0) for x in v
-                    ):  # if no data in a particular channel
-                        self.ping_data_dict_tx[data_type][k] = None
-                    else:
-                        self.ping_data_dict_tx[data_type][k] = self.pad_shorter_ping(v)
-
     def _read_datagrams(self, fid):
         """Read all datagrams.
 
@@ -228,7 +236,6 @@ def _read_datagrams(self, fid):
             num_datagrams_parsed += 1
 
             # Skip any datagram that the user does not want to save
-
             if (
                 not any(new_datagram["type"].startswith(dgram) for dgram in self.data_type)
                 and "ALL" not in self.data_type
@@ -329,7 +336,49 @@ def _read_datagrams(self, fid):
             else:
                 logger.info("Unknown datagram type: " + str(new_datagram["type"]))
 
-    def _append_channel_ping_data(self, datagram, rx=True):
+    def _append_zarr_dgram(self, full_dgram: dict):
+        """
+        Selects a subset of the datagram values that
+        need to be sent directly to a zarr file and
+        appends them to the class variable ``zarr_datagrams``.
+        Additionally, if any power data exists, the
+        conversion factor will be applied to it.
+
+        Parameters
+        ----------
+        full_dgram : dict
+            Successfully parsed datagram containing at least
+            one variable that should be written to a zarr file
+
+        Returns
+        -------
+        reduced_datagram : dict
+            A reduced datagram containing only those variables
+            that should be written to a zarr file and their
+            associated dimensions.
+        """
+
+        wanted_vars = set()
+        for key in self.dgram_zarr_vars.keys():
+            wanted_vars = wanted_vars.union({key, *self.dgram_zarr_vars[key]})
+
+        # construct reduced datagram
+        reduced_datagram = {key: full_dgram[key] for key in wanted_vars if key in full_dgram.keys()}
+
+        # apply conversion factor to power data, if it exists
+        if ("power" in reduced_datagram.keys()) and (
+            isinstance(reduced_datagram["power"], np.ndarray)
+        ):
+
+            # Manufacturer-specific power conversion factor
+            INDEX2POWER = 10.0 * np.log10(2.0) / 256.0
+
+            reduced_datagram["power"] = reduced_datagram["power"].astype("float32") * INDEX2POWER
+
+        if reduced_datagram:
+            self.zarr_datagrams.append(reduced_datagram)
+
+    def _append_channel_ping_data(self, datagram, rx=True, zarr_vars=True):
         """
         Append ping by ping data.
 
@@ -339,11 +388,23 @@ def _append_channel_ping_data(self, datagram, rx=True):
             the newly read sample datagram
         rx : bool
             whether this is receive ping data
+        zarr_vars : bool
+            whether one should account for zarr vars
         """
+
         # TODO: do a thorough check with the convention and processing
         # unsaved = ['channel', 'channel_id', 'low_date', 'high_date', # 'offset', 'frequency' ,
         #            'transmit_mode', 'spare0', 'bytes_read', 'type'] #, 'n_complex']
         ch_id = datagram["channel_id"] if "channel_id" in datagram else datagram["channel"]
+
+        # append zarr variables, if they exist
+        if zarr_vars and rx:
+            common_vars = set(self.dgram_zarr_vars.keys()).intersection(set(datagram.keys()))
+            if common_vars:
+                self._append_zarr_dgram(datagram)
+                for var in common_vars:
+                    del datagram[var]
+
         for k, v in datagram.items():
             if rx:
                 self.ping_data_dict[k][ch_id].append(v)
diff --git a/echopype/convert/parse_ek60.py b/echopype/convert/parse_ek60.py
index 3aa415203..9502c3214 100644
--- a/echopype/convert/parse_ek60.py
+++ b/echopype/convert/parse_ek60.py
@@ -4,8 +4,8 @@
 class ParseEK60(ParseEK):
     """Class for converting data from Simrad EK60 echosounders."""
 
-    def __init__(self, file, params, storage_options={}):
-        super().__init__(file, params, storage_options)
+    def __init__(self, file, params, storage_options={}, dgram_zarr_vars={}):
+        super().__init__(file, params, storage_options, dgram_zarr_vars)
 
     def _select_datagrams(self, params):
         # Translates user input into specific datagrams or ALL
diff --git a/echopype/convert/parse_ek80.py b/echopype/convert/parse_ek80.py
index 738849c81..55027ffe3 100644
--- a/echopype/convert/parse_ek80.py
+++ b/echopype/convert/parse_ek80.py
@@ -4,8 +4,8 @@
 class ParseEK80(ParseEK):
     """Class for converting data from Simrad EK80 echosounders."""
 
-    def __init__(self, file, params, storage_options={}):
-        super().__init__(file, params, storage_options)
+    def __init__(self, file, params, storage_options={}, dgram_zarr_vars={}):
+        super().__init__(file, params, storage_options, dgram_zarr_vars)
         self.environment = {}  # dictionary to store environment data
 
     def _select_datagrams(self, params):
diff --git a/echopype/convert/parsed_to_zarr.py b/echopype/convert/parsed_to_zarr.py
new file mode 100644
index 000000000..1f8fe8edb
--- /dev/null
+++ b/echopype/convert/parsed_to_zarr.py
@@ -0,0 +1,419 @@
+import sys
+import tempfile
+from typing import List, Tuple, Union
+
+import more_itertools as miter
+import numpy as np
+import pandas as pd
+import zarr
+
+
+class Parsed2Zarr:
+    """
+    This class contains functions that facilitate
+    the writing of a parsed file to a zarr file.
+    Additionally, it contains useful information,
+    such as names of array groups and their paths.
+    """
+
+    def __init__(self, parser_obj):
+
+        self.temp_zarr_dir = None
+        self.zarr_file_name = None
+        self.store = None
+        self.zarr_root = None
+        self.parser_obj = parser_obj  # parser object ParseEK60/ParseEK80/etc.
+
+    def _create_zarr_info(self):
+        """
+        Creates the temporary directory for zarr
+        storage, zarr file name, zarr store, and
+        the root group of the zarr store.
+        """
+
+        # temporary directory that will hold the zarr file
+        # TODO: will this work well in the cloud?
+        self.temp_zarr_dir = tempfile.TemporaryDirectory()
+
+        # create zarr store and zarr group we want to write to
+        self.zarr_file_name = self.temp_zarr_dir.name + "/temp.zarr"
+        self.store = zarr.DirectoryStore(self.zarr_file_name)
+        self.zarr_root = zarr.group(store=self.store, overwrite=True)
+
+    def _close_store(self):
+        """properly closes zarr store"""
+
+        # consolidate metadata and close zarr store
+        zarr.consolidate_metadata(self.store)
+        self.store.close()
+
+    @staticmethod
+    def set_multi_index(
+        pd_obj: Union[pd.Series, pd.DataFrame], unique_dims: List[pd.Index]
+    ) -> Union[pd.Series, pd.DataFrame]:
+        """
+        Sets a multi-index from the product of the unique
+        dimension values on a series and then
+        returns it.
+
+        Parameters
+        ----------
+        pd_obj : Union[pd.Series, pd.DataFrame]
+            Series or DataFrame that needs its multi-index modified.
+        unique_dims : List[pd.Index]
+            List where the elements are the unique values
+            of the index.
+
+        Returns
+        -------
+        Union[pd.Series, pd.DataFrame]
+            ``pd_obj`` with product multi-index
+
+        Notes
+        -----
+        By setting the multiindex, this method fills (or pads)
+        missing dimension values.
+        """
+
+        multi_index = pd.MultiIndex.from_product(unique_dims)
+
+        # set product multi-index i.e. a preliminary padding of the df
+        return pd_obj.reindex(multi_index, fill_value=np.nan)
+
+    @staticmethod
+    def get_max_elem_shape(pd_series: pd.Series) -> np.ndarray:
+        """
+        Returns the maximum element shape for a
+        Series that has array elements
+
+        Parameters
+        ----------
+        pd_series: pd.Series
+            Series with array elements
+
+        Returns
+        -------
+        np.ndarray
+            The maximum element shape
+        """
+
+        all_shapes = pd_series.apply(
+            lambda x: np.array(x.shape) if isinstance(x, np.ndarray) else None
+        ).dropna()
+
+        all_dims = np.vstack(all_shapes.to_list())
+
+        return all_dims.max(axis=0)
+
+    def get_col_info(
+        self, pd_series: pd.Series, time_name: str, is_array: bool, max_mb: int
+    ) -> Tuple[int, list]:
+        """
+        Provides the maximum number of times needed to
+        fill at most `max_mb` MB  of memory and the
+        shape of each chunk.
+
+        Parameters
+        ----------
+        pd_series : pd.Series
+            Series representing a column of the datagram df
+        time_name : str
+            The name of the index corresponding to time
+        is_array : bool
+            Specifies if we are working with a column that
+            has arrays
+        max_mb : int
+            Maximum MB allowed for each chunk
+
+        Returns
+        -------
+        max_num_times : int
+            The number of times needed to fill at most
+            `max_mb` MB  of memory.
+        chunk_shape : list
+            The shape of the chunk.
+
+        Notes
+        -----
+        This function assumes that our df has 2 indices and
+        ``time_name`` is one of them.
+
+        For ``chunk_shape`` the first element corresponds to time
+        and this element will be filled later, thus, it is set
+        to None here. The shape of chunk is of the form:
+        ``[None, num_index_2, max_element_shape]`` if we have an
+        array column and ``[None, num_index_2]`` if  we have a
+        column that does not contain an array.
+        """
+
+        multi_ind_names = list(pd_series.index.names)
+
+        if len(multi_ind_names) > 2:
+            raise NotImplementedError("series contains more than 2 indices!")
+
+        multi_ind_names.remove(time_name)  # allows us to infer the other index name
+
+        # get maximum dimension of column element
+        if is_array:
+            max_element_shape = self.get_max_elem_shape(pd_series)
+        else:
+            max_element_shape = 1
+
+        # bytes required to hold one element of the column
+        # TODO: this assumes we are holding floats (the 8 value), generalize it
+        elem_bytes = max_element_shape.prod(axis=0) * 8
+
+        # the number of unique elements in the second index
+        index_2_name = multi_ind_names[0]
+        num_index_2 = len(pd_series.index.unique(index_2_name))
+
+        bytes_per_time = num_index_2 * elem_bytes
+
+        mb_per_time = bytes_per_time / 1e6
+
+        # The maximum number of times needed to fill at most `max_mb` MB of memory
+        max_num_times = max_mb // mb_per_time
+
+        # create form of chunk shape
+        if isinstance(max_element_shape, np.ndarray):
+            chunk_shape = [None, num_index_2, max_element_shape]
+        else:
+            chunk_shape = [None, num_index_2]
+
+        return max_num_times, chunk_shape
+
+    @staticmethod
+    def get_np_chunk(
+        series_chunk: pd.Series, chunk_shape: list, nan_array: np.ndarray
+    ) -> np.ndarray:
+        """
+        Manipulates the ``series_chunk`` values into the
+        correct shape that can then be written to a
+        zarr array.
+
+        Parameters
+        ----------
+        series_chunk : pd.Series
+            A chunk of the dataframe column
+        chunk_shape : list
+            Specifies what shape the numpy chunk
+            should be reshaped to
+        nan_array : np.ndarray
+            An array filled with NaNs that has the
+            maximum length of a column's element.
+            This value is used to pad empty elements.
+
+        Returns
+        -------
+        np_chunk : np.ndarray
+            Final form of series_chunk that can be
+            written to a zarr array
+        """
+
+        if isinstance(nan_array, np.ndarray):
+
+            # appropriately pad elements of series_chunk, if needed
+            padded_elements = []
+            for elm in series_chunk.to_list():
+
+                if isinstance(elm, np.ndarray):
+
+                    # TODO: ideally this would take place in the parser, do this
+                    elm = elm.astype(np.float64)
+
+                    # amount of padding to add to each axis
+                    padding_amount = chunk_shape[2] - elm.shape
+
+                    # create np.pad pad_width
+                    pad_width = [(0, i) for i in padding_amount]
+
+                    padded_array = np.pad(elm, pad_width, "constant", constant_values=np.nan)
+
+                    padded_elements.append(padded_array)
+
+                else:
+                    padded_elements.append(nan_array)
+
+            np_chunk = np.concatenate(padded_elements, axis=0, dtype=np.float64)
+
+            # reshape chunk to the appropriate size
+            full_shape = chunk_shape[:2] + list(chunk_shape[2])
+            np_chunk = np_chunk.reshape(full_shape)
+
+        else:
+            np_chunk = series_chunk.to_numpy().reshape(chunk_shape)
+
+        return np_chunk
+
+    def write_chunks(
+        self,
+        pd_series: pd.Series,
+        zarr_grp: zarr.group,
+        is_array: bool,
+        chunks: list,
+        chunk_shape: list,
+    ) -> None:
+        """
+        Writes ``pd_series`` to ``zarr_grp`` as a zarr array
+        with name ``pd_series.name``, using the specified chunks.
+
+        Parameters
+        ----------
+        pd_series : pd:Series
+            Series representing a column of the datagram df
+        zarr_grp: zarr.group
+            Zarr group that we should write the zarr array to
+        is_array : bool
+            True if ``pd_series`` has elements that are arrays,
+            False otherwise
+        chunks: list
+            A list where each element corresponds to a list of
+            index values that should be chosen for the chunk.
+            For example, if we are chunking along time, ``chunks``
+            would have the form:
+            ``[['2004-09-09 16:19:06.059000', ..., '2004-09-09 16:19:06.746000'],
+               ['2004-09-09 16:19:07.434000', ..., '2004-09-09 16:19:08.121000']]``.
+        chunk_shape: list
+            A list where each element specifies the shape of the
+            zarr chunk for a given element of ``chunks``
+        """
+
+        if is_array:
+            # nan array used in padding of elements
+            nan_array = np.empty(chunk_shape[2], dtype=np.float64)
+            nan_array[:] = np.nan
+        else:
+            nan_array = np.empty(1, dtype=np.float64)
+
+        # obtain the number of times for each chunk
+        chunk_len = [len(i) for i in chunks]
+
+        max_chunk_len = max(chunk_len)
+
+        zarr_chunk_shape = chunk_shape[:2] + list(chunk_shape[2])
+        zarr_chunk_shape[0] = max_chunk_len
+
+        # obtain initial chunk in the proper form
+        series_chunk = pd_series.loc[chunks[0]]
+        chunk_shape[0] = chunk_len[0]
+        np_chunk = self.get_np_chunk(series_chunk, chunk_shape, nan_array)
+
+        # create array in zarr_grp using initial chunk
+        full_array = zarr_grp.array(
+            name=pd_series.name,
+            data=np_chunk,
+            chunks=zarr_chunk_shape,
+            dtype="f8",
+            fill_value="NaN",
+        )
+
+        # append each chunk to full_array
+        for i, chunk in enumerate(chunks[1:], start=1):
+            series_chunk = pd_series.loc[chunk]
+            chunk_shape[0] = chunk_len[i]
+            np_chunk = self.get_np_chunk(series_chunk, chunk_shape, nan_array)
+            full_array.append(np_chunk)
+
+    def write_df_column(
+        self,
+        pd_series: pd.Series,
+        zarr_grp: zarr.group,
+        is_array: bool,
+        unique_time_ind: pd.Index,
+        max_mb: int = 100,
+    ) -> None:
+        """
+        Obtains the appropriate information needed
+        to determine the chunks of a column and
+        then calls the function that writes a
+        column to a zarr array.
+
+        Parameters
+        ----------
+        pd_series: pd.Series
+            Series with product multi-index and elements that
+            are either an array or none of the elements are arrays.
+        zarr_grp: zarr.group
+            Zarr group that we should write the zarr array to
+        is_array : bool
+            True if ``pd_series`` is such that the elements of every
+            column are arrays, False otherwise
+        unique_time_ind : pd.Index
+            The unique time index values of ``pd_series``
+        max_mb : int
+            Maximum MB allowed for each chunk
+
+        Notes
+        -----
+        This assumes that our pd_series has at most 2 indices.
+        """
+
+        if len(pd_series.index.names) > 2:
+            raise NotImplementedError("series contains more than 2 indices!")
+
+        # For a column, obtain the maximum amount of times needed for
+        # each chunk and the associated form for the shape of the chunks
+        max_num_times, chunk_shape = self.get_col_info(
+            pd_series, unique_time_ind.name, is_array=is_array, max_mb=max_mb
+        )
+
+        # evenly chunk unique times so that the smallest and largest
+        # chunk differ by at most 1 element
+        chunks = list(miter.chunked_even(unique_time_ind, max_num_times))
+
+        self.write_chunks(pd_series, zarr_grp, is_array, chunks, chunk_shape)
+
+    def _get_zarr_dgrams_size(self) -> int:
+        """
+        Returns the size in bytes of the list of zarr
+        datagrams.
+        """
+
+        size = 0
+        for i in self.parser_obj.zarr_datagrams:
+
+            size += sum([sys.getsizeof(val) for key, val in i.items()])
+
+        return size
+
+    def array_series_bytes(self, pd_series: pd.Series, n_rows: int) -> int:
+        """
+        Determines the amount of bytes required for a
+        series with array elements, for ``n_rows``.
+
+        Parameters
+        ----------
+        pd_series: pd.Series
+            Series with array elements
+        n_rows: int
+            The number of rows with array elements
+
+        Returns
+        -------
+        The amount of bytes required to hold data
+        """
+
+        # the number of bytes required to hold 1 element of series
+        # Note: this assumes that we are holding floats
+        pow_bytes = self.get_max_elem_shape(pd_series).prod(axis=0) * 8
+
+        # total memory required for series data
+        return n_rows * pow_bytes
+
+    def write_to_zarr(self, **kwargs) -> None:
+        """
+        Determines if the zarr data provided will expand
+        into a form that is larger than a percentage of
+        the total physical RAM.
+        """
+
+        pass
+
+    def datagram_to_zarr(self, **kwargs) -> None:
+        """
+        Facilitates the conversion of a list of
+        datagrams to a form that can be written
+        to a zarr store.
+        """
+
+        pass
diff --git a/echopype/convert/parsed_to_zarr_ek60.py b/echopype/convert/parsed_to_zarr_ek60.py
new file mode 100644
index 000000000..7859eebd1
--- /dev/null
+++ b/echopype/convert/parsed_to_zarr_ek60.py
@@ -0,0 +1,272 @@
+import numpy as np
+import pandas as pd
+import psutil
+
+from .parsed_to_zarr import Parsed2Zarr
+
+
+class Parsed2ZarrEK60(Parsed2Zarr):
+    """
+    Facilitates the writing of parsed data to
+    a zarr file for the EK60 sensor.
+    """
+
+    def __init__(self, parser_obj):
+        super().__init__(parser_obj)
+
+        self.power_dims = ["timestamp", "channel"]
+        self.angle_dims = ["timestamp", "channel"]
+        self.p2z_ch_ids = {}  # channel ids for power, angle, complex
+        self.datagram_df = None  # df created from zarr variables
+
+    @staticmethod
+    def _get_string_dtype(pd_series: pd.Index) -> str:
+        """
+        Returns the string dtype in a format that
+        works for zarr.
+
+        Parameters
+        ----------
+        pd_series: pd.Index
+            A series where all of the elements are strings
+        """
+
+        if all(pd_series.map(type) == str):
+            max_len = pd_series.map(len).max()
+            dtype = f"<U{max_len}"
+        else:
+            raise ValueError("All elements of pd_series must be strings!")
+
+        return dtype
+
+    def _write_power(self, df: pd.DataFrame, max_mb: int) -> None:
+        """
+        Writes the power data and associated indices
+        to a zarr group.
+
+        Parameters
+        ----------
+        df : pd.DataFrame
+            DataFrame that contains power data
+        max_mb : int
+            Maximum MB allowed for each chunk
+        """
+
+        # obtain power data
+        power_series = df.set_index(self.power_dims)["power"].copy()
+
+        # get unique indices
+        times = power_series.index.get_level_values(0).unique()
+        channels = power_series.index.get_level_values(1).unique()
+
+        self.p2z_ch_ids["power"] = channels.values  # store channel ids for variable
+
+        # create multi index using the product of the unique dims
+        unique_dims = [times, channels]
+
+        power_series = self.set_multi_index(power_series, unique_dims)
+
+        # write power data to the power group
+        zarr_grp = self.zarr_root.create_group("power")
+        self.write_df_column(
+            pd_series=power_series,
+            zarr_grp=zarr_grp,
+            is_array=True,
+            unique_time_ind=times,
+            max_mb=max_mb,
+        )
+
+        # write the unique indices to the power group
+        zarr_grp.array(
+            name=self.power_dims[0], data=times.values, dtype=times.dtype.str, fill_value="NaT"
+        )
+
+        dtype = self._get_string_dtype(channels)
+        zarr_grp.array(name=self.power_dims[1], data=channels.values, dtype=dtype, fill_value=None)
+
+    @staticmethod
+    def _split_angle_data(angle_series: pd.Series) -> pd.DataFrame:
+        """
+        Splits the 2D angle data into two 1D arrays
+        representing angle_athwartship and angle_alongship,
+        for each element in ``angle_series``.
+
+        Parameters
+        ----------
+        angle_series : pd.Series
+            Series representing the angle data
+
+        Returns
+        -------
+        DataFrame with columns angle_athwartship and
+        angle_alongship obtained from splitting the
+        2D angle data, with that same index as
+        ``angle_series``
+        """
+
+        # split each angle element into angle_athwartship and angle_alongship
+        angle_split = angle_series.apply(
+            lambda x: [x[:, 0], x[:, 1]] if isinstance(x, np.ndarray) else [None, None]
+        )
+
+        return pd.DataFrame(
+            data=angle_split.to_list(),
+            columns=["angle_athwartship", "angle_alongship"],
+            index=angle_series.index,
+        )
+
+    def _write_angle(self, df: pd.DataFrame, max_mb: int) -> None:
+        """
+        Writes the angle data and associated indices
+        to a zarr group.
+
+        Parameters
+        ----------
+        df : pd.DataFrame
+            DataFrame that contains angle data
+        max_mb : int
+            Maximum MB allowed for each chunk
+        """
+
+        # obtain angle data
+        angle_series = df.set_index(self.angle_dims)["angle"].copy()
+
+        angle_df = self._split_angle_data(angle_series)
+
+        # get unique indices
+        times = angle_df.index.get_level_values(0).unique()
+        channels = angle_df.index.get_level_values(1).unique()
+
+        self.p2z_ch_ids["angle"] = channels.values  # store channel ids for variable
+
+        # create multi index using the product of the unique dims
+        unique_dims = [times, channels]
+
+        angle_df = self.set_multi_index(angle_df, unique_dims)
+
+        # write angle data to the angle group
+        zarr_grp = self.zarr_root.create_group("angle")
+        for column in angle_df:
+            self.write_df_column(
+                pd_series=angle_df[column],
+                zarr_grp=zarr_grp,
+                is_array=True,
+                unique_time_ind=times,
+                max_mb=max_mb,
+            )
+
+        # write the unique indices to the angle group
+        zarr_grp.array(
+            name=self.angle_dims[0], data=times.values, dtype=times.dtype.str, fill_value="NaT"
+        )
+
+        dtype = self._get_string_dtype(channels)
+        zarr_grp.array(name=self.angle_dims[1], data=channels.values, dtype=dtype, fill_value=None)
+
+    def _get_power_angle_size(self, df: pd.DataFrame) -> int:
+        """
+        Returns the total memory in bytes required to
+        store the expanded power and angle data.
+
+        Parameters
+        ----------
+        df: pd.DataFrame
+            DataFrame containing the power, angle, and
+            the appropriate dimension data
+        """
+
+        # get unique indices
+        times = df[self.power_dims[0]].unique()
+        channels = df[self.power_dims[1]].unique()
+
+        # get final form of index
+        multi_index = pd.MultiIndex.from_product([times, channels])
+
+        # get the total memory required for expanded zarr variables
+        pow_mem = self.array_series_bytes(df["power"], multi_index.shape[0])
+        angle_mem = self.array_series_bytes(df["angle"], multi_index.shape[0])
+
+        return pow_mem + angle_mem
+
+    def write_to_zarr(self, mem_mult: float = 0.3) -> bool:
+        """
+        Determines if the zarr data provided will expand
+        into a form that is larger than a percentage of
+        the total physical RAM.
+
+        Parameters
+        ----------
+        mem_mult : float
+            Multiplier for total physical RAM
+
+        Notes
+        -----
+        If ``mem_mult`` times the total RAM is less
+        than the total memory required to store the
+        expanded zarr variables, this function will
+        return True, otherwise False.
+        """
+
+        # create datagram df, if it does not exist
+        if not isinstance(self.datagram_df, pd.DataFrame):
+            self.datagram_df = pd.DataFrame.from_dict(self.parser_obj.zarr_datagrams)
+
+        total_mem = self._get_power_angle_size(self.datagram_df)
+
+        # get statistics about system memory usage
+        mem = psutil.virtual_memory()
+
+        zarr_dgram_size = self._get_zarr_dgrams_size()
+
+        # approx. the amount of memory that will be used after expansion
+        req_mem = mem.used - zarr_dgram_size + total_mem
+
+        # free memory, if we no longer need it
+        if mem.total * mem_mult > req_mem:
+            del self.datagram_df
+        else:
+            del self.parser_obj.zarr_datagrams
+
+        return mem.total * mem_mult < req_mem
+
+    def datagram_to_zarr(self, max_mb: int) -> None:
+        """
+        Facilitates the conversion of a list of
+        datagrams to a form that can be written
+        to a zarr store.
+
+        Parameters
+        ----------
+        max_mb : int
+            Maximum MB allowed for each chunk
+
+        Notes
+        -----
+        This function specifically writes chunks along the time
+        index.
+
+        The chunking routine evenly distributes the times such
+        that each chunk differs by at most one time. This makes
+        it so that the memory required for each chunk is approximately
+        the same.
+        """
+
+        self._create_zarr_info()
+
+        # create datagram df, if it does not exist
+        if not isinstance(self.datagram_df, pd.DataFrame):
+            self.datagram_df = pd.DataFrame.from_dict(self.parser_obj.zarr_datagrams)
+            del self.parser_obj.zarr_datagrams  # free memory
+
+        # convert channel column to a string
+        self.datagram_df["channel"] = self.datagram_df["channel"].astype(str)
+
+        self._write_power(df=self.datagram_df, max_mb=max_mb)
+
+        del self.datagram_df["power"]  # free memory
+
+        self._write_angle(df=self.datagram_df, max_mb=max_mb)
+
+        del self.datagram_df  # free memory
+
+        self._close_store()
diff --git a/echopype/convert/parsed_to_zarr_ek80.py b/echopype/convert/parsed_to_zarr_ek80.py
new file mode 100644
index 000000000..afa90c73d
--- /dev/null
+++ b/echopype/convert/parsed_to_zarr_ek80.py
@@ -0,0 +1,293 @@
+import numpy as np
+import pandas as pd
+import psutil
+
+from .parsed_to_zarr_ek60 import Parsed2ZarrEK60
+
+
+class Parsed2ZarrEK80(Parsed2ZarrEK60):
+    """
+    Facilitates the writing of parsed data to
+    a zarr file for the EK80 sensor.
+    """
+
+    def __init__(self, parser_obj):
+        super().__init__(parser_obj)
+
+        self.power_dims = ["timestamp", "channel_id"]
+        self.angle_dims = ["timestamp", "channel_id"]
+        self.complex_dims = ["timestamp", "channel_id"]
+        self.p2z_ch_ids = {}  # channel ids for power, angle, complex
+        self.pow_ang_df = None  # df that holds power and angle data
+        self.complex_df = None  # df that holds complex data
+
+    def _get_num_transd_sec(self, x: pd.DataFrame):
+        """
+        Returns the number of transducer sectors.
+
+        Parameters
+        ----------
+        x : pd.DataFrame
+            DataFrame representing the complex series
+        """
+
+        num_transducer_sectors = np.unique(
+            np.array(self.parser_obj.ping_data_dict["n_complex"][x.name[1]])
+        )
+        if num_transducer_sectors.size > 1:  # this is not supposed to happen
+            raise ValueError("Transducer sector number changes in the middle of the file!")
+        else:
+            num_transducer_sectors = num_transducer_sectors[0]
+
+        return num_transducer_sectors
+
+    def _reshape_series(self, complex_series: pd.Series) -> pd.Series:
+        """
+        Reshapes complex series into the correct form, taking
+        into account the beam dimension. The new shape of
+        each element of ``complex_series`` will be
+        (element length, num_transducer_sectors).
+
+        Parameters
+        ----------
+        complex_series: pd.Series
+            Series representing the complex data
+        """
+
+        # get dimension 2, which represents the number of transducer elements
+        dim_2 = pd.DataFrame(complex_series).apply(self._get_num_transd_sec, axis=1)
+        dim_2.name = "dim_2"
+
+        range_sample_len = complex_series.apply(
+            lambda x: x.shape[0] if isinstance(x, np.ndarray) else 0
+        )
+
+        # get dimension 1, which represents the new range_sample length
+        dim_1 = (range_sample_len / dim_2).astype("int")
+        dim_1.name = "dim_1"
+
+        comp_shape_df = pd.concat([complex_series, dim_1, dim_2], axis=1)
+
+        return comp_shape_df.apply(
+            lambda x: x.values[0].reshape((x.dim_1, x.dim_2))
+            if isinstance(x.values[0], np.ndarray)
+            else None,
+            axis=1,
+        )
+
+    @staticmethod
+    def _split_complex_data(complex_series: pd.Series) -> pd.DataFrame:
+        """
+        Splits the 1D complex data into two 1D arrays
+        representing the real and imaginary parts of
+        the complex data, for each element in ``complex_series``.
+
+        Parameters
+        ----------
+        complex_series : pd.Series
+            Series representing the complex data
+
+        Returns
+        -------
+        DataFrame with columns backscatter_r and
+        backscatter_i obtained from splitting the
+        complex data into real and imaginary parts,
+        respectively. The DataFrame will have the
+        same index as ``complex_series``.
+        """
+
+        complex_split = complex_series.apply(
+            lambda x: [np.real(x), np.imag(x)] if isinstance(x, np.ndarray) else [None, None]
+        )
+
+        return pd.DataFrame(
+            data=complex_split.to_list(),
+            columns=["backscatter_r", "backscatter_i"],
+            index=complex_series.index,
+        )
+
+    def _write_complex(self, df: pd.DataFrame, max_mb: int):
+        """
+        Writes the complex data and associated indices
+        to a zarr group.
+
+        Parameters
+        ----------
+        df : pd.DataFrame
+            DataFrame that contains angle data
+        max_mb : int
+            Maximum MB allowed for each chunk
+        """
+
+        # obtain complex data and drop NaNs
+        complex_series = df.set_index(self.complex_dims)["complex"].copy()
+
+        # get unique indices
+        times = complex_series.index.get_level_values(0).unique()
+        channels = complex_series.index.get_level_values(1).unique()
+
+        complex_series = self._reshape_series(complex_series)
+
+        complex_df = self._split_complex_data(complex_series)
+
+        self.p2z_ch_ids["complex"] = channels.values  # store channel ids for variable
+
+        # create multi index using the product of the unique dims
+        unique_dims = [times, channels]
+
+        complex_df = self.set_multi_index(complex_df, unique_dims)
+
+        # write complex data to the complex group
+        zarr_grp = self.zarr_root.create_group("complex")
+        for column in complex_df:
+            self.write_df_column(
+                pd_series=complex_df[column],
+                zarr_grp=zarr_grp,
+                is_array=True,
+                unique_time_ind=times,
+                max_mb=max_mb,
+            )
+
+        # write the unique indices to the complex group
+        zarr_grp.array(
+            name=self.complex_dims[0], data=times.values, dtype=times.dtype.str, fill_value="NaT"
+        )
+
+        dtype = self._get_string_dtype(channels)
+        zarr_grp.array(
+            name=self.complex_dims[1], data=channels.values, dtype=dtype, fill_value=None
+        )
+
+    def _get_complex_size(self, df: pd.DataFrame) -> int:
+        """
+        Returns the total memory in bytes required to
+        store the expanded complex data.
+
+        Parameters
+        ----------
+        df: pd.DataFrame
+            DataFrame containing the complex and
+            the appropriate dimension data
+        """
+
+        # get unique indices
+        times = df[self.complex_dims[0]].unique()
+        channels = df[self.complex_dims[1]].unique()
+
+        # get final form of index
+        multi_index = pd.MultiIndex.from_product([times, channels])
+
+        # get the total memory required for expanded zarr variables
+        complex_mem = self.array_series_bytes(df["complex"], multi_index.shape[0])
+
+        # multiply by 2 because we store both the complex and real parts
+        return 2 * complex_mem
+
+    def _get_zarr_dfs(self):
+        """
+        Creates the DataFrames that hold the power, angle, and
+        complex data, which are needed for downstream computation.
+        """
+
+        datagram_df = pd.DataFrame.from_dict(self.parser_obj.zarr_datagrams)
+
+        # get df corresponding to power and angle only
+        self.pow_ang_df = datagram_df[["power", "angle", "timestamp", "channel_id"]].copy()
+
+        # remove power and angle to conserve memory
+        del datagram_df["power"]
+        del datagram_df["angle"]
+
+        # drop rows with missing power and angle data
+        self.pow_ang_df.dropna(how="all", subset=["power", "angle"], inplace=True)
+
+        self.complex_df = datagram_df.dropna().copy()
+
+    def write_to_zarr(self, mem_mult: float = 0.3) -> bool:
+        """
+        Determines if the zarr data provided will expand
+        into a form that is larger than a percentage of
+        the total physical RAM.
+
+        Parameters
+        ----------
+        mem_mult : float
+            Multiplier for total physical RAM
+
+        Notes
+        -----
+        If ``mem_mult`` times the total RAM is less
+        than the total memory required to store the
+        expanded zarr variables, this function will
+        return True, otherwise False.
+        """
+        isinstance(self.datagram_df, pd.DataFrame)
+        # create zarr dfs, if they do not exist
+        if not isinstance(self.pow_ang_df, pd.DataFrame) and not isinstance(
+            self.complex_df, pd.DataFrame
+        ):
+            self._get_zarr_dfs()
+
+        # get memory required for zarr data
+        pow_ang_total_mem = self._get_power_angle_size(self.pow_ang_df)
+        comp_total_mem = self._get_complex_size(self.complex_df)
+        total_mem = pow_ang_total_mem + comp_total_mem
+
+        # get statistics about system memory usage
+        mem = psutil.virtual_memory()
+
+        zarr_dgram_size = self._get_zarr_dgrams_size()
+
+        # approx. the amount of memory that will be used after expansion
+        req_mem = mem.used - zarr_dgram_size + total_mem
+
+        # free memory, if we no longer need it
+        if mem.total * mem_mult > req_mem:
+            del self.pow_ang_df
+            del self.complex_df
+        else:
+            del self.parser_obj.zarr_datagrams
+
+        return mem.total * mem_mult < req_mem
+
+    def datagram_to_zarr(self, max_mb: int) -> None:
+        """
+        Facilitates the conversion of a list of
+        datagrams to a form that can be written
+        to a zarr store.
+
+        Parameters
+        ----------
+        max_mb : int
+            Maximum MB allowed for each chunk
+
+        Notes
+        -----
+        This function specifically writes chunks along the time
+        index.
+
+        The chunking routine evenly distributes the times such
+        that each chunk differs by at most one time. This makes
+        it so that the memory required for each chunk is approximately
+        the same.
+        """
+
+        self._create_zarr_info()
+
+        # create zarr dfs, if they do not exist
+        if not isinstance(self.pow_ang_df, pd.DataFrame) and not isinstance(
+            self.complex_df, pd.DataFrame
+        ):
+            self._get_zarr_dfs()
+            del self.parser_obj.zarr_datagrams  # free memory
+
+        self._write_power(df=self.pow_ang_df, max_mb=max_mb)
+        self._write_angle(df=self.pow_ang_df, max_mb=max_mb)
+
+        del self.pow_ang_df  # free memory
+
+        self._write_complex(df=self.complex_df, max_mb=max_mb)
+
+        del self.complex_df  # free memory
+
+        self._close_store()
diff --git a/echopype/convert/set_groups_base.py b/echopype/convert/set_groups_base.py
index 5f0e118dd..c3e19fa13 100644
--- a/echopype/convert/set_groups_base.py
+++ b/echopype/convert/set_groups_base.py
@@ -1,6 +1,7 @@
 import abc
-from typing import Set
+from typing import List, Set, Tuple
 
+import dask.array
 import numpy as np
 import pynmea2
 import xarray as xr
@@ -27,6 +28,7 @@ def __init__(
         compress=True,
         overwrite=True,
         params=None,
+        parsed2zarr_obj=None,
     ):
         # parser object ParseEK60/ParseAZFP/etc...
         self.parser_obj = parser_obj
@@ -41,6 +43,9 @@ def __init__(
         self.overwrite = overwrite
         self.ui_param = params
 
+        # parsed data written directly to zarr object
+        self.parsed2zarr_obj = parsed2zarr_obj
+
         if not self.compress:
             self.compression_settings = None
         else:
@@ -319,3 +324,220 @@ def beam_groups_to_convention(
 
         self._add_ping_time_dim(ds, beam_ping_time_names, ping_time_only_names)
         self._add_beam_dim(ds, beam_only_names, beam_ping_time_names)
+
+    def _get_channel_ids(self, chan_str: np.ndarray) -> List[str]:
+        """
+        Obtains the channel IDs associated with ``chan_str``.
+
+        Parameters
+        ----------
+        chan_str : np.ndarray
+            A numpy array of strings corresponding to the
+            keys of ``config_datagram["transceivers"]``
+
+        Returns
+        -------
+        A list of strings representing the channel IDS
+        """
+        if self.sonar_model in ["EK60", "ES70"]:
+            return [
+                self.parser_obj.config_datagram["transceivers"][int(i)]["channel_id"]
+                for i in chan_str
+            ]
+        else:
+            return [
+                self.parser_obj.config_datagram["configuration"][i]["channel_id"] for i in chan_str
+            ]
+
+    def _get_power_dataarray(self, zarr_path: str) -> xr.DataArray:
+        """
+        Constructs a DataArray from a Dask array for the power
+        data.
+
+        Parameters
+        ----------
+        zarr_path: str
+            Path to the zarr file that contain the power data
+
+        Returns
+        -------
+        DataArray named "backscatter_r" representing the
+        power data.
+        """
+
+        # collect variables associated with the power data
+        power = dask.array.from_zarr(zarr_path, component="power/power")
+
+        pow_time_path = "power/" + self.parsed2zarr_obj.power_dims[0]
+        pow_chan_path = "power/" + self.parsed2zarr_obj.power_dims[1]
+        power_time = dask.array.from_zarr(zarr_path, component=pow_time_path).compute()
+        power_channel = dask.array.from_zarr(zarr_path, component=pow_chan_path).compute()
+
+        # obtain channel names for power data
+        pow_chan_names = self._get_channel_ids(power_channel)
+
+        backscatter_r = xr.DataArray(
+            data=power,
+            coords={
+                "ping_time": (
+                    ["ping_time"],
+                    power_time,
+                    self._varattrs["beam_coord_default"]["ping_time"],
+                ),
+                "channel": (
+                    ["channel"],
+                    pow_chan_names,
+                    self._varattrs["beam_coord_default"]["channel"],
+                ),
+                "range_sample": (
+                    ["range_sample"],
+                    np.arange(power.shape[2]),
+                    self._varattrs["beam_coord_default"]["range_sample"],
+                ),
+            },
+            name="backscatter_r",
+            attrs={"long_name": "Backscatter power", "units": "dB"},
+        )
+
+        return backscatter_r
+
+    def _get_angle_dataarrays(self, zarr_path: str) -> Tuple[xr.DataArray, xr.DataArray]:
+        """
+        Constructs the DataArrays from Dask arrays associated
+        with the angle data.
+
+        Parameters
+        ----------
+        zarr_path: str
+            Path to the zarr file that contains the angle data
+
+        Returns
+        -------
+        DataArrays named "angle_athwartship" and "angle_alongship",
+        respectively, representing the angle data.
+        """
+
+        # collect variables associated with the angle data
+        angle_along = dask.array.from_zarr(zarr_path, component="angle/angle_alongship")
+        angle_athwart = dask.array.from_zarr(zarr_path, component="angle/angle_athwartship")
+
+        ang_time_path = "angle/" + self.parsed2zarr_obj.angle_dims[0]
+        ang_chan_path = "angle/" + self.parsed2zarr_obj.angle_dims[1]
+        angle_time = dask.array.from_zarr(zarr_path, component=ang_time_path).compute()
+        angle_channel = dask.array.from_zarr(zarr_path, component=ang_chan_path).compute()
+
+        # obtain channel names for angle data
+        ang_chan_names = self._get_channel_ids(angle_channel)
+
+        array_coords = {
+            "ping_time": (
+                ["ping_time"],
+                angle_time,
+                self._varattrs["beam_coord_default"]["ping_time"],
+            ),
+            "channel": (
+                ["channel"],
+                ang_chan_names,
+                self._varattrs["beam_coord_default"]["channel"],
+            ),
+            "range_sample": (
+                ["range_sample"],
+                np.arange(angle_athwart.shape[2]),
+                self._varattrs["beam_coord_default"]["range_sample"],
+            ),
+        }
+
+        angle_athwartship = xr.DataArray(
+            data=angle_athwart,
+            coords=array_coords,
+            name="angle_athwartship",
+            attrs={
+                "long_name": "electrical athwartship angle",
+                "comment": (
+                    "Introduced in echopype for Simrad echosounders. "  # noqa
+                    + "The athwartship angle corresponds to the major angle in SONAR-netCDF4 vers 2. "  # noqa
+                ),
+            },
+        )
+
+        angle_alongship = xr.DataArray(
+            data=angle_along,
+            coords=array_coords,
+            name="angle_alongship",
+            attrs={
+                "long_name": "electrical alongship angle",
+                "comment": (
+                    "Introduced in echopype for Simrad echosounders. "  # noqa
+                    + "The alongship angle corresponds to the minor angle in SONAR-netCDF4 vers 2. "  # noqa
+                ),
+            },
+        )
+
+        return angle_athwartship, angle_alongship
+
+    def _get_complex_dataarrays(self, zarr_path: str) -> Tuple[xr.DataArray, xr.DataArray]:
+        """
+        Constructs the DataArrays from Dask arrays associated
+        with the complex data.
+
+        Parameters
+        ----------
+        zarr_path: str
+            Path to the zarr file that contains the complex data
+
+        Returns
+        -------
+        DataArrays named "backscatter_r" and "backscatter_i",
+        respectively, representing the complex data.
+        """
+
+        # collect variables associated with the complex data
+        complex_r = dask.array.from_zarr(zarr_path, component="complex/backscatter_r")
+        complex_i = dask.array.from_zarr(zarr_path, component="complex/backscatter_i")
+
+        comp_time_path = "complex/" + self.parsed2zarr_obj.complex_dims[0]
+        comp_chan_path = "complex/" + self.parsed2zarr_obj.complex_dims[1]
+        complex_time = dask.array.from_zarr(zarr_path, component=comp_time_path).compute()
+        complex_channel = dask.array.from_zarr(zarr_path, component=comp_chan_path).compute()
+
+        # obtain channel names for complex data
+        comp_chan_names = self._get_channel_ids(complex_channel)
+
+        array_coords = {
+            "ping_time": (
+                ["ping_time"],
+                complex_time,
+                self._varattrs["beam_coord_default"]["ping_time"],
+            ),
+            "channel": (
+                ["channel"],
+                comp_chan_names,
+                self._varattrs["beam_coord_default"]["channel"],
+            ),
+            "range_sample": (
+                ["range_sample"],
+                np.arange(complex_r.shape[2]),
+                self._varattrs["beam_coord_default"]["range_sample"],
+            ),
+            "beam": (
+                ["beam"],
+                np.arange(start=1, stop=complex_r.shape[3] + 1).astype(str),
+                self._varattrs["beam_coord_default"]["beam"],
+            ),
+        }
+
+        backscatter_r = xr.DataArray(
+            data=complex_r,
+            coords=array_coords,
+            name="backscatter_r",
+            attrs={"long_name": "Real part of backscatter power", "units": "V"},
+        )
+
+        backscatter_i = xr.DataArray(
+            data=complex_i,
+            coords=array_coords,
+            name="backscatter_i",
+            attrs={"long_name": "Imaginary part of backscatter power", "units": "V"},
+        )
+
+        return backscatter_r, backscatter_i
diff --git a/echopype/convert/set_groups_ek60.py b/echopype/convert/set_groups_ek60.py
index bc44a13c5..cbc6c1484 100644
--- a/echopype/convert/set_groups_ek60.py
+++ b/echopype/convert/set_groups_ek60.py
@@ -387,6 +387,42 @@ def set_platform(self, NMEA_only=False) -> xr.Dataset:
 
         return set_encodings(ds)
 
+    def _set_beam_group1_zarr_vars(self, ds: xr.Dataset) -> xr.Dataset:
+        """
+        Modifies ds by setting all variables associated with
+        ``Beam_group1``, that were directly written to a
+        temporary zarr file.
+
+        Parameters
+        ----------
+        ds : xr.Dataset
+            Dataset representing ``Beam_group1`` filled with
+            all variables, besides those written to zarr
+
+        Returns
+        -------
+        A modified version of ``ds`` with the zarr variables
+        added to it.
+        """
+
+        # TODO: In the future it would be nice to have a dictionary of
+        #  attributes stored in one place for all of the variables.
+        #  This would reduce unnecessary code duplication in the
+        #  functions below.
+
+        # obtain DataArrays using zarr variables
+        zarr_path = self.parsed2zarr_obj.zarr_file_name
+        backscatter_r = self._get_power_dataarray(zarr_path)
+        angle_athwartship, angle_alongship = self._get_angle_dataarrays(zarr_path)
+
+        # append DataArrays created from zarr file
+        ds = ds.assign(
+            backscatter_r=backscatter_r,
+            angle_athwartship=angle_athwartship,
+            angle_alongship=angle_alongship,
+        )
+        return ds
+
     def set_beam(self) -> xr.Dataset:
         """Set the /Sonar/Beam_group1 group."""
         # Get channel keys and frequency
@@ -582,117 +618,136 @@ def set_beam(self) -> xr.Dataset:
         # Construct Dataset with ping-by-ping data from all channels
         ds_backscatter = []
         for ch in ch_ids:
-            ds_tmp = xr.Dataset(
-                {
-                    "backscatter_r": (
-                        ["ping_time", "range_sample"],
-                        self.parser_obj.ping_data_dict["power"][ch],
-                        {"long_name": "Backscatter power", "units": "dB"},
-                    ),
-                    "sample_interval": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["sample_interval"][ch],
-                        {
-                            "long_name": "Interval between recorded raw data samples",
-                            "units": "s",
-                            "valid_min": 0.0,
-                        },
-                    ),
-                    "transmit_bandwidth": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["bandwidth"][ch],
-                        {
-                            "long_name": "Nominal bandwidth of transmitted pulse",
-                            "units": "Hz",
-                            "valid_min": 0.0,
-                        },
-                    ),
-                    "transmit_duration_nominal": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["pulse_length"][ch],
-                        {
-                            "long_name": "Nominal bandwidth of transmitted pulse",
-                            "units": "s",
-                            "valid_min": 0.0,
-                        },
-                    ),
-                    "transmit_power": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["transmit_power"][ch],
-                        {
-                            "long_name": "Nominal transmit power",
-                            "units": "W",
-                            "valid_min": 0.0,
-                        },
-                    ),
-                    "data_type": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["mode"][ch],
-                        {
-                            "long_name": "recorded data type (1-power only, 2-angle only 3-power and angle)"  # noqa
-                        },
-                    ),
-                    "count": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["count"][ch],
-                        {"long_name": "Number of samples "},
-                    ),
-                    "offset": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["offset"][ch],
-                        {"long_name": "Offset of first sample"},
-                    ),
-                    "transmit_mode": (
-                        ["ping_time"],
-                        self.parser_obj.ping_data_dict["transmit_mode"][ch],
-                        {"long_name": "0 = Active, 1 = Passive, 2 = Test, -1 = Unknown"},
-                    ),
-                },
-                coords={
-                    "ping_time": (
-                        ["ping_time"],
-                        self.parser_obj.ping_time[ch],
-                        self._varattrs["beam_coord_default"]["ping_time"],
-                    ),
-                    "range_sample": (
-                        ["range_sample"],
-                        np.arange(self.parser_obj.ping_data_dict["power"][ch].shape[1]),
-                        self._varattrs["beam_coord_default"]["range_sample"],
-                    ),
-                },
-            )
 
-            # Save angle data if exist based on values in self.parser_obj.ping_data_dict['mode'][ch]
-            # Assume the mode of all pings are identical
-            # 1 = Power only, 2 = Angle only 3 = Power & Angle
-            if np.all(np.array(self.parser_obj.ping_data_dict["mode"][ch]) != 1):
-                ds_tmp = ds_tmp.assign(
+            var_dict = {
+                "sample_interval": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["sample_interval"][ch],
                     {
-                        "angle_athwartship": (
-                            ["ping_time", "range_sample"],
-                            self.parser_obj.ping_data_dict["angle"][ch][:, :, 0],
-                            {
-                                "long_name": "electrical athwartship angle",
-                                "comment": (
-                                    "Introduced in echopype for Simrad echosounders. "  # noqa
-                                    + "The athwartship angle corresponds to the major angle in SONAR-netCDF4 vers 2. "  # noqa
-                                ),
-                            },
+                        "long_name": "Interval between recorded raw data samples",
+                        "units": "s",
+                        "valid_min": 0.0,
+                    },
+                ),
+                "transmit_bandwidth": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["bandwidth"][ch],
+                    {
+                        "long_name": "Nominal bandwidth of transmitted pulse",
+                        "units": "Hz",
+                        "valid_min": 0.0,
+                    },
+                ),
+                "transmit_duration_nominal": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["pulse_length"][ch],
+                    {
+                        "long_name": "Nominal bandwidth of transmitted pulse",
+                        "units": "s",
+                        "valid_min": 0.0,
+                    },
+                ),
+                "transmit_power": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["transmit_power"][ch],
+                    {
+                        "long_name": "Nominal transmit power",
+                        "units": "W",
+                        "valid_min": 0.0,
+                    },
+                ),
+                "data_type": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["mode"][ch],
+                    {
+                        "long_name": "recorded data type (1-power only, 2-angle only 3-power and angle)"  # noqa
+                    },
+                ),
+                "count": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["count"][ch],
+                    {"long_name": "Number of samples "},
+                ),
+                "offset": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["offset"][ch],
+                    {"long_name": "Offset of first sample"},
+                ),
+                "transmit_mode": (
+                    ["ping_time"],
+                    self.parser_obj.ping_data_dict["transmit_mode"][ch],
+                    {"long_name": "0 = Active, 1 = Passive, 2 = Test, -1 = Unknown"},
+                ),
+            }
+
+            if not self.parsed2zarr_obj.temp_zarr_dir:
+
+                var_dict["backscatter_r"] = (
+                    ["ping_time", "range_sample"],
+                    self.parser_obj.ping_data_dict["power"][ch],
+                    {"long_name": "Backscatter power", "units": "dB"},
+                )
+
+                ds_tmp = xr.Dataset(
+                    var_dict,
+                    coords={
+                        "ping_time": (
+                            ["ping_time"],
+                            self.parser_obj.ping_time[ch],
+                            self._varattrs["beam_coord_default"]["ping_time"],
                         ),
-                        "angle_alongship": (
-                            ["ping_time", "range_sample"],
-                            self.parser_obj.ping_data_dict["angle"][ch][:, :, 1],
-                            {
-                                "long_name": "electrical alongship angle",
-                                "comment": (
-                                    "Introduced in echopype for Simrad echosounders. "  # noqa
-                                    + "The alongship angle corresponds to the minor angle in SONAR-netCDF4 vers 2. "  # noqa
-                                ),
-                            },
+                        "range_sample": (
+                            ["range_sample"],
+                            np.arange(self.parser_obj.ping_data_dict["power"][ch].shape[1]),
+                            self._varattrs["beam_coord_default"]["range_sample"],
+                        ),
+                    },
+                )
+            else:
+                ds_tmp = xr.Dataset(
+                    var_dict,
+                    coords={
+                        "ping_time": (
+                            ["ping_time"],
+                            self.parser_obj.ping_time[ch],
+                            self._varattrs["beam_coord_default"]["ping_time"],
                         ),
-                    }
+                    },
                 )
 
+            if not self.parsed2zarr_obj.temp_zarr_dir:
+                # Save angle data if exist based on values in
+                # self.parser_obj.ping_data_dict['mode'][ch]
+                # Assume the mode of all pings are identical
+                # 1 = Power only, 2 = Angle only 3 = Power & Angle
+                if np.all(np.array(self.parser_obj.ping_data_dict["mode"][ch]) != 1):
+                    ds_tmp = ds_tmp.assign(
+                        {
+                            "angle_athwartship": (
+                                ["ping_time", "range_sample"],
+                                self.parser_obj.ping_data_dict["angle"][ch][:, :, 0],
+                                {
+                                    "long_name": "electrical athwartship angle",
+                                    "comment": (
+                                        "Introduced in echopype for Simrad echosounders. "  # noqa
+                                        + "The athwartship angle corresponds to the major angle in SONAR-netCDF4 vers 2. "  # noqa
+                                    ),
+                                },
+                            ),
+                            "angle_alongship": (
+                                ["ping_time", "range_sample"],
+                                self.parser_obj.ping_data_dict["angle"][ch][:, :, 1],
+                                {
+                                    "long_name": "electrical alongship angle",
+                                    "comment": (
+                                        "Introduced in echopype for Simrad echosounders. "  # noqa
+                                        + "The alongship angle corresponds to the minor angle in SONAR-netCDF4 vers 2. "  # noqa
+                                    ),
+                                },
+                            ),
+                        }
+                    )
+
             # Attach frequency dimension/coordinate
             ds_tmp = ds_tmp.expand_dims(
                 {"channel": [self.parser_obj.config_datagram["transceivers"][ch]["channel_id"]]}
@@ -707,6 +762,9 @@ def set_beam(self) -> xr.Dataset:
             [ds, xr.merge(ds_backscatter)], combine_attrs="override"
         )  # override keeps the Dataset attributes
 
+        if self.parsed2zarr_obj.temp_zarr_dir:
+            ds = self._set_beam_group1_zarr_vars(ds)
+
         # Manipulate some Dataset dimensions to adhere to convention
         self.beam_groups_to_convention(
             ds, self.beam_only_names, self.beam_ping_time_names, self.ping_time_only_names
diff --git a/echopype/convert/set_groups_ek80.py b/echopype/convert/set_groups_ek80.py
index 40f8f1fb3..dedbbac2c 100644
--- a/echopype/convert/set_groups_ek80.py
+++ b/echopype/convert/set_groups_ek80.py
@@ -66,6 +66,11 @@ class SetGroupsEK80(SetGroupsBase):
     def __init__(self, *args, **kwargs):
         super().__init__(*args, **kwargs)
 
+        # if we have zarr files, create parser_obj.ch_ids
+        if self.parsed2zarr_obj.temp_zarr_dir:
+            for k, v in self.parsed2zarr_obj.p2z_ch_ids.items():
+                self.parser_obj.ch_ids[k] = self._get_channel_ids(v)
+
     def set_env(self) -> xr.Dataset:
         """Set the Environment group."""
 
@@ -546,54 +551,20 @@ def _assemble_ds_ping_invariant(self, params, data_type):
 
         return ds
 
-    def _assemble_ds_complex(self, ch):
-        num_transducer_sectors = np.unique(
-            np.array(self.parser_obj.ping_data_dict["n_complex"][ch])
-        )
-        if num_transducer_sectors.size > 1:  # this is not supposed to happen
-            raise ValueError("Transducer sector number changes in the middle of the file!")
-        else:
-            num_transducer_sectors = num_transducer_sectors[0]
-
-        data_shape = self.parser_obj.ping_data_dict["complex"][ch].shape
-        data_shape = (
-            data_shape[0],
-            int(data_shape[1] / num_transducer_sectors),
-            num_transducer_sectors,
-        )
-        data = self.parser_obj.ping_data_dict["complex"][ch].reshape(data_shape)
+    def _add_freq_start_end_ds(self, ds_tmp: xr.Dataset, ch: str) -> xr.Dataset:
+        """
+        Returns a Dataset with variables
+        ``frequency_start`` and ``frequency_end``
+        added to ``ds_tmp`` for a specific channel,
+        if ``frequency_start`` is in ping_data_dict.
 
-        ds_tmp = xr.Dataset(
-            {
-                "backscatter_r": (
-                    ["ping_time", "range_sample", "beam"],
-                    np.real(data),
-                    {"long_name": "Real part of backscatter power", "units": "V"},
-                ),
-                "backscatter_i": (
-                    ["ping_time", "range_sample", "beam"],
-                    np.imag(data),
-                    {"long_name": "Imaginary part of backscatter power", "units": "V"},
-                ),
-            },
-            coords={
-                "ping_time": (
-                    ["ping_time"],
-                    self.parser_obj.ping_time[ch],
-                    self._varattrs["beam_coord_default"]["ping_time"],
-                ),
-                "range_sample": (
-                    ["range_sample"],
-                    np.arange(data_shape[1]),
-                    self._varattrs["beam_coord_default"]["range_sample"],
-                ),
-                "beam": (
-                    ["beam"],
-                    np.arange(start=1, stop=num_transducer_sectors + 1).astype(str),
-                    self._varattrs["beam_coord_default"]["beam"],
-                ),
-            },
-        )
+        Parameters
+        ----------
+        ds_tmp: xr.Dataset
+            Dataset containing the complex data
+        ch: str
+            Channel id
+        """
 
         # CW data encoded as complex samples do NOT have frequency_start and frequency_end
         # TODO: use PulseForm instead of checking for the existence
@@ -602,6 +573,7 @@ def _assemble_ds_complex(self, ch):
             "frequency_start" in self.parser_obj.ping_data_dict.keys()
             and self.parser_obj.ping_data_dict["frequency_start"][ch]
         ):
+
             ds_f_start_end = xr.Dataset(
                 {
                     "frequency_start": (
@@ -639,19 +611,41 @@ def _assemble_ds_complex(self, ch):
                     ),
                 },
             )
+
             ds_tmp = xr.merge(
                 [ds_tmp, ds_f_start_end], combine_attrs="override"
             )  # override keeps the Dataset attributes
 
-        return set_encodings(ds_tmp)
+        return ds_tmp
+
+    def _assemble_ds_complex(self, ch):
+        num_transducer_sectors = np.unique(
+            np.array(self.parser_obj.ping_data_dict["n_complex"][ch])
+        )
+        if num_transducer_sectors.size > 1:  # this is not supposed to happen
+            raise ValueError("Transducer sector number changes in the middle of the file!")
+        else:
+            num_transducer_sectors = num_transducer_sectors[0]
+
+        data_shape = self.parser_obj.ping_data_dict["complex"][ch].shape
+        data_shape = (
+            data_shape[0],
+            int(data_shape[1] / num_transducer_sectors),
+            num_transducer_sectors,
+        )
+        data = self.parser_obj.ping_data_dict["complex"][ch].reshape(data_shape)
 
-    def _assemble_ds_power(self, ch):
         ds_tmp = xr.Dataset(
             {
                 "backscatter_r": (
-                    ["ping_time", "range_sample"],
-                    self.parser_obj.ping_data_dict["power"][ch],
-                    {"long_name": "Backscatter power", "units": "dB"},
+                    ["ping_time", "range_sample", "beam"],
+                    np.real(data),
+                    {"long_name": "Real part of backscatter power", "units": "V"},
+                ),
+                "backscatter_i": (
+                    ["ping_time", "range_sample", "beam"],
+                    np.imag(data),
+                    {"long_name": "Imaginary part of backscatter power", "units": "V"},
                 ),
             },
             coords={
@@ -662,12 +656,40 @@ def _assemble_ds_power(self, ch):
                 ),
                 "range_sample": (
                     ["range_sample"],
-                    np.arange(self.parser_obj.ping_data_dict["power"][ch].shape[1]),
+                    np.arange(data_shape[1]),
                     self._varattrs["beam_coord_default"]["range_sample"],
                 ),
+                "beam": (
+                    ["beam"],
+                    np.arange(start=1, stop=num_transducer_sectors + 1).astype(str),
+                    self._varattrs["beam_coord_default"]["beam"],
+                ),
             },
         )
 
+        ds_tmp = self._add_freq_start_end_ds(ds_tmp, ch)
+
+        return set_encodings(ds_tmp)
+
+    def _add_trasmit_pulse_complex(self, ds_tmp: xr.Dataset, ch: str) -> xr.Dataset:
+        """
+        Adds RAW4 datagram values (transmit pulse recorded in
+        complex samples), if it exists, to the power and angle
+        data.
+
+        Parameters
+        ----------
+        ds_tmp : xr.Dataset
+            Dataset to add the transmit data to
+        ch : str
+            Name of channel key to grab the data from
+
+        Returns
+        -------
+        ds_tmp : xr.Dataset
+            The input Dataset with transmit data added to it.
+        """
+
         # If RAW4 datagram (transmit pulse recorded in complex samples) exists
         if len(self.parser_obj.ping_data_dict_tx["complex"]) != 0:
             # Add coordinate transmit_sample
@@ -707,6 +729,34 @@ def _assemble_ds_power(self, ch):
                 },
             )
 
+        return ds_tmp
+
+    def _assemble_ds_power(self, ch):
+
+        ds_tmp = xr.Dataset(
+            {
+                "backscatter_r": (
+                    ["ping_time", "range_sample"],
+                    self.parser_obj.ping_data_dict["power"][ch],
+                    {"long_name": "Backscatter power", "units": "dB"},
+                ),
+            },
+            coords={
+                "ping_time": (
+                    ["ping_time"],
+                    self.parser_obj.ping_time[ch],
+                    self._varattrs["beam_coord_default"]["ping_time"],
+                ),
+                "range_sample": (
+                    ["range_sample"],
+                    np.arange(self.parser_obj.ping_data_dict["power"][ch].shape[1]),
+                    self._varattrs["beam_coord_default"]["range_sample"],
+                ),
+            },
+        )
+
+        ds_tmp = self._add_trasmit_pulse_complex(ds_tmp, ch)
+
         # If angle data exist
         if ch in self.parser_obj.ch_ids["angle"]:
             ds_tmp = ds_tmp.assign(
@@ -799,22 +849,139 @@ def _assemble_ds_common(self, ch, range_sample_size):
         )
         return set_encodings(ds_common)
 
+    @staticmethod
+    def merge_save(ds_combine: List[xr.Dataset], ds_invariant: xr.Dataset) -> xr.Dataset:
+        """Merge data from all complex or all power/angle channels"""
+        ds_combine = xr.merge(ds_combine)
+
+        ds_combine = xr.merge(
+            [ds_invariant, ds_combine], combine_attrs="override"
+        )  # override keeps the Dataset attributes
+        return set_encodings(ds_combine)
+
+    def _attach_vars_to_ds_data(self, ds_data: xr.Dataset, ch: str, rs_size: int) -> xr.Dataset:
+        """
+        Attaches common variables and the channel dimension.
+
+        Parameters
+        ----------
+        ds_data : xr.Dataset
+            Data set to add variables to
+        ch: str
+            Channel string associated with variables
+        rs_size: int
+            The size of the range sample dimension
+            i.e. ``range_sample.size``
+
+        Returns
+        -------
+        ``ds_data`` with the variables added to it.
+        """
+
+        ds_common = self._assemble_ds_common(ch, rs_size)
+
+        ds_data = xr.merge([ds_data, ds_common], combine_attrs="override")
+
+        # Attach channel dimension/coordinate
+        ds_data = ds_data.expand_dims(
+            {"channel": [self.parser_obj.config_datagram["configuration"][ch]["channel_id"]]}
+        )
+        ds_data["channel"] = ds_data["channel"].assign_attrs(
+            **self._varattrs["beam_coord_default"]["channel"]
+        )
+
+        return ds_data
+
+    def _get_ds_beam_power_zarr(self, ds_invariant_power: xr.Dataset) -> xr.Dataset:
+        """
+        Constructs the data set `ds_beam_power` when
+        there are zarr variables present.
+
+        Parameters
+        ----------
+        ds_invariant_power : xr.Dataset
+            Dataset for ping-invariant params associated with power
+
+        Returns
+        -------
+        A Dataset representing `ds_beam_power`.
+        """
+
+        # TODO: In the future it would be nice to have a dictionary of
+        #  attributes stored in one place for all of the variables.
+        #  This would reduce unnecessary code duplication in the
+        #  functions below.
+
+        # obtain DataArrays using zarr variables
+        zarr_path = self.parsed2zarr_obj.zarr_file_name
+        backscatter_r = self._get_power_dataarray(zarr_path)
+        angle_athwartship, angle_alongship = self._get_angle_dataarrays(zarr_path)
+
+        # create power related ds using DataArrays created from zarr file
+        ds_power = xr.merge([backscatter_r, angle_athwartship, angle_alongship])
+        ds_power = set_encodings(ds_power)
+
+        # obtain additional variables that need to be added to ds_power
+        ds_tmp = []
+        for ch in self.parser_obj.ch_ids["power"]:
+            ds_data = self._add_trasmit_pulse_complex(ds_tmp=xr.Dataset(), ch=ch)
+            ds_data = set_encodings(ds_data)
+
+            ds_data = self._attach_vars_to_ds_data(ds_data, ch, rs_size=ds_power.range_sample.size)
+            ds_tmp.append(ds_data)
+
+        ds_tmp = self.merge_save(ds_tmp, ds_invariant_power)
+
+        return xr.merge([ds_tmp, ds_power], combine_attrs="override")
+
+    def _get_ds_complex_zarr(self, ds_invariant_complex: xr.Dataset) -> xr.Dataset:
+        """
+        Constructs the data set `ds_complex` when
+        there are zarr variables present.
+
+        Parameters
+        ----------
+        ds_invariant_complex : xr.Dataset
+            Dataset for ping-invariant params associated with complex data
+
+        Returns
+        -------
+        A Dataset representing `ds_complex`.
+        """
+
+        # TODO: In the future it would be nice to have a dictionary of
+        #  attributes stored in one place for all of the variables.
+        #  This would reduce unnecessary code duplication in the
+        #  functions below.
+
+        # obtain DataArrays using zarr variables
+        zarr_path = self.parsed2zarr_obj.zarr_file_name
+        backscatter_r, backscatter_i = self._get_complex_dataarrays(zarr_path)
+
+        # create power related ds using DataArrays created from zarr file
+        ds_complex = xr.merge([backscatter_r, backscatter_i])
+        ds_complex = set_encodings(ds_complex)
+
+        # obtain additional variables that need to be added to ds_complex
+        ds_tmp = []
+        for ch in self.parser_obj.ch_ids["complex"]:
+            ds_data = self._add_trasmit_pulse_complex(ds_tmp=xr.Dataset(), ch=ch)
+            ds_data = self._add_freq_start_end_ds(ds_data, ch)
+
+            ds_data = set_encodings(ds_data)
+
+            ds_data = self._attach_vars_to_ds_data(
+                ds_data, ch, rs_size=ds_complex.range_sample.size
+            )
+            ds_tmp.append(ds_data)
+
+        ds_tmp = self.merge_save(ds_tmp, ds_invariant_complex)
+
+        return xr.merge([ds_tmp, ds_complex], combine_attrs="override")
+
     def set_beam(self) -> List[xr.Dataset]:
         """Set the /Sonar/Beam_group1 group."""
 
-        def merge_save(ds_combine: List[xr.Dataset], ds_type: str):
-            """Merge data from all complex or all power/angle channels"""
-            ds_combine = xr.merge(ds_combine)
-            if ds_type == "complex":
-                ds_combine = xr.merge(
-                    [ds_invariant_complex, ds_combine], combine_attrs="override"
-                )  # override keeps the Dataset attributes
-            else:
-                ds_combine = xr.merge(
-                    [ds_invariant_power, ds_combine], combine_attrs="override"
-                )  # override keeps the Dataset attributes
-            return set_encodings(ds_combine)
-
         # Assemble ping-invariant beam data variables
         params = [
             "transducer_beam_type",
@@ -840,43 +1007,58 @@ def merge_save(ds_combine: List[xr.Dataset], ds_type: str):
         if self.parser_obj.ch_ids["power"]:
             ds_invariant_power = self._assemble_ds_ping_invariant(params, "power")
 
-        # Assemble dataset for backscatter data and other ping-by-ping data
-        ds_complex = []
-        ds_power = []
-        for ch in self.parser_obj.config_datagram["configuration"].keys():
-            if ch in self.parser_obj.ch_ids["complex"]:
-                ds_data = self._assemble_ds_complex(ch)
-            elif ch in self.parser_obj.ch_ids["power"]:
-                ds_data = self._assemble_ds_power(ch)
-            else:  # skip for channels containing no data
-                continue
-            ds_common = self._assemble_ds_common(ch, ds_data.range_sample.size)
-            ds_data = xr.merge(
-                [ds_data, ds_common], combine_attrs="override"
-            )  # override keeps the Dataset attributes
-            # Attach channel dimension/coordinate
-            ds_data = ds_data.expand_dims(
-                {"channel": [self.parser_obj.config_datagram["configuration"][ch]["channel_id"]]}
-            )
-            ds_data["channel"] = ds_data["channel"].assign_attrs(
-                **self._varattrs["beam_coord_default"]["channel"]
-            )
-            if ch in self.parser_obj.ch_ids["complex"]:
-                ds_complex.append(ds_data)
+        if not self.parsed2zarr_obj.temp_zarr_dir:
+            # Assemble dataset for backscatter data and other ping-by-ping data
+            ds_complex = []
+            ds_power = []
+            for ch in self.parser_obj.config_datagram["configuration"].keys():
+                if ch in self.parser_obj.ch_ids["complex"]:
+                    ds_data = self._assemble_ds_complex(ch)
+                elif ch in self.parser_obj.ch_ids["power"]:
+                    ds_data = self._assemble_ds_power(ch)
+                else:  # skip for channels containing no data
+                    continue
+
+                ds_data = self._attach_vars_to_ds_data(
+                    ds_data, ch, rs_size=ds_data.range_sample.size
+                )
+
+                if ch in self.parser_obj.ch_ids["complex"]:
+                    ds_complex.append(ds_data)
+                else:
+                    ds_power.append(ds_data)
+
+            # Merge and save group:
+            #  if both complex and power data exist: complex data in /Sonar/Beam_group1 group
+            #   and power data in /Sonar/Beam_group2
+            #  if only one type of data exist: data in /Sonar/Beam_group1 group
+            ds_beam_power = None
+            if len(ds_complex) > 0:
+                ds_beam = self.merge_save(ds_complex, ds_invariant_complex)
+                if len(ds_power) > 0:
+                    ds_beam_power = self.merge_save(ds_power, ds_invariant_power)
             else:
-                ds_power.append(ds_data)
-
-        # Merge and save group:
-        #  if both complex and power data exist: complex data in /Sonar/Beam_group1 group
-        #   and power data in /Sonar/Beam_group2
-        #  if only one type of data exist: data in /Sonar/Beam_group1 group
-        ds_beam_power = None
-        if len(ds_complex) > 0:
-            ds_beam = merge_save(ds_complex, ds_type="complex")
-            if len(ds_power) > 0:
-                ds_beam_power = merge_save(ds_power, ds_type="power")
+                ds_beam = self.merge_save(ds_power, ds_invariant_power)
         else:
-            ds_beam = merge_save(ds_power, ds_type="power")
+            if self.parser_obj.ch_ids["power"]:
+                ds_power = self._get_ds_beam_power_zarr(ds_invariant_power)
+            else:
+                ds_power = None
+
+            ds_beam_power = ds_power
+
+            if self.parser_obj.ch_ids["complex"]:
+                ds_complex = self._get_ds_complex_zarr(ds_invariant_complex)
+            else:
+                ds_complex = None
+
+            # correctly assign the beam groups
+            if ds_complex:
+                ds_beam = ds_complex
+                if ds_power:
+                    ds_beam_power = ds_power
+            else:
+                ds_beam = ds_power
 
         # Manipulate some Dataset dimensions to adhere to convention
         if isinstance(ds_beam_power, xr.Dataset):
diff --git a/echopype/core.py b/echopype/core.py
index 77d25f5ed..6b933be01 100644
--- a/echopype/core.py
+++ b/echopype/core.py
@@ -9,6 +9,8 @@
 from .convert.parse_azfp import ParseAZFP
 from .convert.parse_ek60 import ParseEK60
 from .convert.parse_ek80 import ParseEK80
+from .convert.parsed_to_zarr_ek60 import Parsed2ZarrEK60
+from .convert.parsed_to_zarr_ek80 import Parsed2ZarrEK80
 from .convert.set_groups_ad2cp import SetGroupsAd2cp
 from .convert.set_groups_azfp import SetGroupsAZFP
 from .convert.set_groups_ek60 import SetGroupsEK60
@@ -43,6 +45,7 @@ def inner(test_ext: str):
         "validate_ext": validate_azfp_ext,
         "xml": True,
         "parser": ParseAZFP,
+        "parsed2zarr": None,
         "set_groups": SetGroupsAZFP,
         "concat_dims": {
             "platform": "time2",
@@ -54,11 +57,13 @@ def inner(test_ext: str):
             "platform": "all",
             "default": "minimal",
         },
+        "dgram_zarr_vars": {},
     },
     "EK60": {
         "validate_ext": validate_ext(".raw"),
         "xml": False,
         "parser": ParseEK60,
+        "parsed2zarr": Parsed2ZarrEK60,
         "set_groups": SetGroupsEK60,
         "concat_dims": {
             "platform": ["time1", "time2", "time3"],
@@ -69,11 +74,13 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {"power": ["timestamp", "channel"], "angle": ["timestamp", "channel"]},
     },
     "ES70": {
         "validate_ext": validate_ext(".raw"),
         "xml": False,
         "parser": ParseEK60,
+        "parsed2zarr": Parsed2ZarrEK60,
         "set_groups": SetGroupsEK60,
         "concat_dims": {
             "platform": ["time1", "time2", "time3"],
@@ -84,11 +91,13 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {"power": ["timestamp", "channel"], "angle": ["timestamp", "channel"]},
     },
     "EK80": {
         "validate_ext": validate_ext(".raw"),
         "xml": False,
         "parser": ParseEK80,
+        "parsed2zarr": Parsed2ZarrEK80,
         "set_groups": SetGroupsEK80,
         "concat_dims": {
             "platform": ["time1", "time2", "time3"],
@@ -99,11 +108,17 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {
+            "power": ["timestamp", "channel_id"],
+            "complex": ["timestamp", "channel_id"],
+            "angle": ["timestamp", "channel_id"],
+        },
     },
     "ES80": {
         "validate_ext": validate_ext(".raw"),
         "xml": False,
         "parser": ParseEK80,
+        "parsed2zarr": Parsed2ZarrEK80,
         "set_groups": SetGroupsEK80,
         "concat_dims": {
             "platform": ["time1", "time2", "time3"],
@@ -114,11 +129,17 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {
+            "power": ["timestamp", "channel_id"],
+            "complex": ["timestamp", "channel_id"],
+            "angle": ["timestamp", "channel_id"],
+        },
     },
     "EA640": {
         "validate_ext": validate_ext(".raw"),
         "xml": False,
         "parser": ParseEK80,
+        "parsed2zarr": Parsed2ZarrEK80,
         "set_groups": SetGroupsEK80,
         "concat_dims": {
             "platform": ["time1", "time2", "time3"],
@@ -129,11 +150,17 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {
+            "power": ["timestamp", "channel_id"],
+            "complex": ["timestamp", "channel_id"],
+            "angle": ["timestamp", "channel_id"],
+        },
     },
     "AD2CP": {
         "validate_ext": validate_ext(".ad2cp"),
         "xml": False,
         "parser": ParseAd2cp,
+        "parsed2zarr": None,
         "set_groups": SetGroupsAd2cp,
         "concat_dims": {
             "platform": "ping_time",
@@ -144,5 +171,6 @@ def inner(test_ext: str):
         "concat_data_vars": {
             "default": "minimal",
         },
+        "dgram_zarr_vars": {},
     },
 }
diff --git a/echopype/echodata/api.py b/echopype/echodata/api.py
index ab6cc3103..d7b25fcfc 100644
--- a/echopype/echodata/api.py
+++ b/echopype/echodata/api.py
@@ -7,7 +7,10 @@
 
 
 def open_converted(
-    converted_raw_path: "PathHint", storage_options: Dict[str, str] = None, **kwargs
+    converted_raw_path: "PathHint",
+    storage_options: Dict[str, str] = None,
+    **kwargs
+    # kwargs: Dict[str, Any] = {'chunks': 'auto'} # TODO: do we need this?
 ):
     """Create an EchoData object from a single converted netcdf or zarr file.
 
diff --git a/echopype/echodata/echodata.py b/echopype/echodata/echodata.py
index 904cdb515..e48ae0ba7 100644
--- a/echopype/echodata/echodata.py
+++ b/echopype/echodata/echodata.py
@@ -54,6 +54,7 @@ def __init__(
         xml_path: Optional["PathHint"] = None,
         sonar_model: Optional["SonarModelsHint"] = None,
         open_kwargs: Optional[Dict[str, Any]] = None,
+        parsed2zarr_obj=None,
     ):
 
         # TODO: consider if should open datasets in init
@@ -69,6 +70,9 @@ def __init__(
         self.converted_raw_path: Optional["PathHint"] = converted_raw_path
         self._tree: Optional["DataTree"] = None
 
+        # object associated with directly writing to a zarr file
+        self.parsed2zarr_obj = parsed2zarr_obj
+
         self.__setup_groups()
         # self.__read_converted(converted_raw_path)
 
diff --git a/echopype/tests/convert/test_parsed_to_zarr.py b/echopype/tests/convert/test_parsed_to_zarr.py
new file mode 100644
index 000000000..42f051689
--- /dev/null
+++ b/echopype/tests/convert/test_parsed_to_zarr.py
@@ -0,0 +1,111 @@
+import pytest
+from echopype import open_raw
+
+
+@pytest.fixture
+def ek60_path(test_path):
+    return test_path['EK60']
+
+
+@pytest.fixture
+def ek80_path(test_path):
+    return test_path['EK80']
+
+
+def compare_zarr_vars(ed_zarr, ed_no_zarr, var_to_comp, ed_path):
+    # TODO: add docstring and comments
+
+    for var in var_to_comp:
+
+        for chan in ed_zarr[ed_path][var].channel:
+
+            # here we compute to make sure values are being compared, rather than just shapes
+            var_zarr = ed_zarr[ed_path][var].sel(channel=chan).compute()
+            var_no_zarr = ed_no_zarr[ed_path][var].sel(channel=chan)
+
+            assert var_zarr.identical(var_no_zarr)
+
+    ed_zarr[ed_path] = ed_zarr[ed_path].drop(var_to_comp)
+    ed_no_zarr[ed_path] = ed_no_zarr[ed_path].drop(var_to_comp)
+    return ed_zarr, ed_no_zarr
+
+
+@pytest.mark.parametrize(
+    ["raw_file", "sonar_model", "offload_to_zarr"],
+    [
+        ("L0003-D20040909-T161906-EK60.raw", "EK60", True),
+        pytest.param(
+            "L0003-D20040909-T161906-EK60.raw",
+            "EK60",
+            False,
+            marks=pytest.mark.xfail(
+                run=False,
+                reason="Expected out of memory error. See https://github.com/OSOceanAcoustics/echopype/issues/489",
+            ),
+        ),
+    ],
+    ids=["noaa_offloaded", "noaa_not_offloaded"],
+)
+def test_raw2zarr(raw_file, sonar_model, offload_to_zarr, ek60_path):
+    """Tests for memory expansion relief"""
+    import os
+    from tempfile import TemporaryDirectory
+    from echopype.echodata.echodata import EchoData
+    name = os.path.basename(raw_file).replace('.raw', '')
+    fname = f"{name}__{offload_to_zarr}.zarr"
+    file_path = ek60_path / raw_file
+    echodata = open_raw(
+        raw_file=file_path,
+        sonar_model=sonar_model,
+        offload_to_zarr=offload_to_zarr
+    )
+    # Most likely succeed if it doesn't crash
+    assert isinstance(echodata, EchoData)
+    with TemporaryDirectory() as tmpdir:
+        output_save_path = tmpdir + f"/{fname}"
+        echodata.to_zarr(output_save_path)
+        # If it goes all the way to here it is most likely successful
+        assert os.path.exists(output_save_path)
+
+
+@pytest.mark.skip(reason="Full testing of writing variables directly to a zarr store has not been implemented yet.")
+def test_writing_directly_to_zarr(ek60_path, ek80_path):
+    """
+    Tests that ensure writing variables directly to a
+    temporary zarr store and then assigning them to
+    the EchoData object create an EchoData object that
+    is identical to the method of not writing directly
+    to a zarr.
+    """
+
+    pass
+
+    # TODO: use the below structure to compare small files
+    # TODO: also create a test that runs L0003-D20040909-T161906-EK60.raw (the 95MB file that explodes)
+
+    # ed_zarr = ep.open_raw(path_to_raw, sonar_model=sonar_model, offload_to_zarr=True, max_zarr_mb=100)
+    # ed_no_zarr = ep.open_raw(path_to_raw, sonar_model=sonar_model, offload_to_zarr=False)
+    #
+    # for grp in ed_zarr.group_paths:
+    #
+    #     if "conversion_time" in ed_zarr[grp].attrs:
+    #         del ed_zarr[grp].attrs["conversion_time"]
+    #         del ed_no_zarr[grp].attrs["conversion_time"]
+    #
+    #     # Compare straight up angle, power, complex, if zarr
+    #     # drop the zarr variables and compare datasets
+    #
+    #     if grp == "Sonar/Beam_group2":
+    #         var_to_comp = ['angle_athwartship', 'angle_alongship', 'backscatter_r']
+    #         ed_zarr, ed_no_zarr = compare_zarr_vars(ed_zarr, ed_no_zarr, var_to_comp, grp)
+    #
+    #     if grp == "Sonar/Beam_group1":
+    #
+    #         if 'backscatter_i' in ed_zarr[grp]:
+    #             var_to_comp = ['backscatter_r', 'backscatter_i']
+    #         else:
+    #             var_to_comp = ['angle_athwartship', 'angle_alongship', 'backscatter_r']
+    #
+    #         ed_zarr, ed_no_zarr = compare_zarr_vars(ed_zarr, ed_no_zarr, var_to_comp, grp)
+    #
+    #     assert ed_zarr[grp].identical(ed_no_zarr[grp])
diff --git a/requirements.txt b/requirements.txt
index 772214e73..af476b5c5 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -10,3 +10,5 @@ zarr
 fsspec
 s3fs
 xarray-datatree==0.0.6
+psutil==5.9.1
+more_itertools==8.13.0

From 657ca1ea05ea2253fd7fe283431883b3eefc827e Mon Sep 17 00:00:00 2001
From: b-reyes <53541061+b-reyes@users.noreply.github.com>
Date: Thu, 11 Aug 2022 15:40:16 -0700
Subject: [PATCH 21/23] Modify `set_beam()` so it returns a list (#780)

* make set_beam() return a list for all sonar models and remove casting

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove xr import

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
---
 echopype/convert/api.py             | 4 ----
 echopype/convert/set_groups_azfp.py | 6 ++++--
 echopype/convert/set_groups_ek60.py | 5 +++--
 3 files changed, 7 insertions(+), 8 deletions(-)

diff --git a/echopype/convert/api.py b/echopype/convert/api.py
index 243cb8e07..3c510f5fd 100644
--- a/echopype/convert/api.py
+++ b/echopype/convert/api.py
@@ -3,7 +3,6 @@
 from typing import TYPE_CHECKING, Dict, Optional, Tuple
 
 import fsspec
-import xarray as xr
 import zarr
 from datatree import DataTree
 
@@ -521,9 +520,6 @@ def open_raw(
 
     # Set multi beam groups
     beam_groups = setgrouper.set_beam()
-    if isinstance(beam_groups, xr.Dataset):
-        # if it's a single dataset like the ek60, make into list
-        beam_groups = [beam_groups]
 
     valid_beam_groups_count = 0
     for idx, beam_group in enumerate(beam_groups, start=1):
diff --git a/echopype/convert/set_groups_azfp.py b/echopype/convert/set_groups_azfp.py
index 8f982a487..f4ae00475 100644
--- a/echopype/convert/set_groups_azfp.py
+++ b/echopype/convert/set_groups_azfp.py
@@ -1,6 +1,8 @@
 """
 Class to save unpacked echosounder data to appropriate groups in netcdf or zarr.
 """
+from typing import List
+
 import numpy as np
 import xarray as xr
 
@@ -161,7 +163,7 @@ def _create_unique_channel_name(unpacked_data):
                 + " one serial number has not been implemented."
             )
 
-    def set_beam(self) -> xr.Dataset:
+    def set_beam(self) -> List[xr.Dataset]:
         """Set the Beam group."""
         unpacked_data = self.parser_obj.unpacked_data
         parameters = self.parser_obj.parameters
@@ -257,7 +259,7 @@ def set_beam(self) -> xr.Dataset:
             ds, self.beam_only_names, self.beam_ping_time_names, self.ping_time_only_names
         )
 
-        return set_encodings(ds)
+        return [set_encodings(ds)]
 
     def set_vendor(self) -> xr.Dataset:
         """Set the Vendor_specific group."""
diff --git a/echopype/convert/set_groups_ek60.py b/echopype/convert/set_groups_ek60.py
index cbc6c1484..a0efcba0e 100644
--- a/echopype/convert/set_groups_ek60.py
+++ b/echopype/convert/set_groups_ek60.py
@@ -1,4 +1,5 @@
 from collections import defaultdict
+from typing import List
 
 import numpy as np
 import xarray as xr
@@ -423,7 +424,7 @@ def _set_beam_group1_zarr_vars(self, ds: xr.Dataset) -> xr.Dataset:
         )
         return ds
 
-    def set_beam(self) -> xr.Dataset:
+    def set_beam(self) -> List[xr.Dataset]:
         """Set the /Sonar/Beam_group1 group."""
         # Get channel keys and frequency
         ch_ids = list(self.parser_obj.config_datagram["transceivers"].keys())
@@ -770,7 +771,7 @@ def set_beam(self) -> xr.Dataset:
             ds, self.beam_only_names, self.beam_ping_time_names, self.ping_time_only_names
         )
 
-        return set_encodings(ds)
+        return [set_encodings(ds)]
 
     def set_vendor(self) -> xr.Dataset:
         # Retrieve pulse length and sa correction

From cc82592a65318ffe0f632fcb37d8aa8e9fb68aea Mon Sep 17 00:00:00 2001
From: Wu-Jung Lee <leewujung@gmail.com>
Date: Sat, 13 Aug 2022 11:09:57 -0700
Subject: [PATCH 22/23] Add `depth` as an additional variable in Sv dataset
 (#738)

* add first version of swapping dims to freq from channel

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* prototyping function to add depth variable

* fix a few things in function

* fix type

* remove utils/common and move content to consolidate/api

* clean up remnants of moving functions

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add test and fix bugs

* Update echopype/consolidate/api.py

Co-authored-by: Emilio Mayorga <emiliomayorga@gmail.com>

* Update echopype/consolidate/api.py

Co-authored-by: Emilio Mayorga <emiliomayorga@gmail.com>

* simplify add_depth to use only scalar inputs, udpate test

* add notes in docstring

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Emilio Mayorga <emiliomayorga@gmail.com>
---
 echopype/consolidate/__init__.py              |   4 +-
 echopype/consolidate/api.py                   | 109 ++++++++++++
 echopype/preprocess/__init__.py               |   3 +-
 echopype/preprocess/api.py                    |  34 ----
 .../tests/consolidate/test_consolidate.py     | 163 ++++++++++++++++++
 echopype/tests/preprocess/test_api.py         | 105 -----------
 6 files changed, 275 insertions(+), 143 deletions(-)
 delete mode 100644 echopype/tests/preprocess/test_api.py

diff --git a/echopype/consolidate/__init__.py b/echopype/consolidate/__init__.py
index ec7ca9dec..d26813ab8 100644
--- a/echopype/consolidate/__init__.py
+++ b/echopype/consolidate/__init__.py
@@ -1,3 +1,3 @@
-from .api import add_location
+from .api import add_depth, add_location, swap_dims_channel_frequency
 
-__all__ = ["add_location"]
+__all__ = ["swap_dims_channel_frequency", "add_depth", "add_location"]
diff --git a/echopype/consolidate/api.py b/echopype/consolidate/api.py
index 090d3a108..de81481bc 100644
--- a/echopype/consolidate/api.py
+++ b/echopype/consolidate/api.py
@@ -7,6 +7,115 @@
 from ..echodata import EchoData
 
 
+def swap_dims_channel_frequency(ds: xr.Dataset) -> xr.Dataset:
+    """
+    Use frequency_nominal in place of channel to be dataset dimension and coorindate.
+
+    This is useful because the nominal transducer frequencies are commonly used to
+    refer to data collected from a specific transducer.
+
+    Parameters
+    ----------
+    ds : xr.Dataset
+        Dataset for which the dimension will be swapped
+
+    Returns
+    -------
+    The input dataset with the dimension swapped
+
+    Note
+    ----
+    This operation is only possible when there are no duplicated frequencies present in the file.
+    """
+    # Only possible if no duplicated frequencies
+    if np.unique(ds["frequency_nominal"]).size == ds["frequency_nominal"].size:
+        return (
+            ds.set_coords("frequency_nominal")
+            .swap_dims({"channel": "frequency_nominal"})
+            .reset_coords("channel")
+        )
+    else:
+        raise ValueError(
+            "Duplicated transducer nominal frequencies exist in the file. "
+            "Operation is not valid."
+        )
+
+
+def add_depth(
+    ds: xr.Dataset,
+    depth_offset: float = 0,
+    tilt: float = 0,
+    downward: bool = True,
+) -> xr.Dataset:
+    """
+    Create a depth data variable based on data in Sv dataset.
+
+    The depth is generated based on whether the transducers are mounted vertically
+    or with a polar angle to vertical, and whether the transducers were pointed
+    up or down.
+
+    Parameters
+    ----------
+    ds : xr.Dataset
+        Source Sv dataset to which a depth variable will be added.
+        Must contain `echo_range`.
+    depth_offset : float
+        Offset along the vertical (depth) dimension to account for actual transducer
+        position in water, since `echo_range` is counted from transducer surface.
+        Default is 0.
+    tilt : float
+        Transducer tilt angle [degree].
+        Default is 0 (transducer mounted vertically).
+    downward : bool
+        Whether or not the transducers point downward.
+        Default to True.
+
+    Returns
+    -------
+    The input dataset with a `depth` variable added
+
+    Notes
+    -----
+    Currently this function only scalar inputs of depth_offset and tilt angle.
+    In future expansion we plan to add the following options:
+
+    * Allow inputs as xr.DataArray for time-varying variations of these variables
+    * Use data stored in the EchoData object or raw-converted file from which the Sv is derived,
+      specifically `water_level`, `vertical_offtset` and `tilt` in the `Platform` group.
+    """
+    # TODO: add options to use water_depth, vertical_offset, tilt stored in EchoData
+    # # Water level has to come from somewhere
+    # if depth_offset is None:
+    #     if "water_level" in ds:
+    #         depth_offset = ds["water_level"]
+    #     else:
+    #         raise ValueError(
+    #             "water_level not found in dataset and needs to be supplied by the user"
+    #         )
+
+    # # If not vertical needs to have tilt
+    # if not vertical:
+    #     if tilt is None:
+    #         if "tilt" in ds:
+    #             tilt = ds["tilt"]
+    #         else:
+    #             raise ValueError(
+    #                 "tilt not found in dataset and needs to be supplied by the user. "
+    #                 "Required when vertical=False"
+    #             )
+    # else:
+    #     tilt = 0
+
+    # Multiplication factor depending on if transducers are pointing downward
+    mult = 1 if downward else -1
+
+    # Compute depth
+    ds["depth"] = mult * ds["echo_range"] * np.cos(tilt / 180 * np.pi) + depth_offset
+    ds["depth"].attrs = {"long_name": "Depth", "standard_name": "depth"}
+
+    return ds
+
+
 def add_location(ds: xr.Dataset, echodata: EchoData = None, nmea_sentence: Optional[str] = None):
     """
     Add geographical location (latitude/longitude) to the Sv dataset.
diff --git a/echopype/preprocess/__init__.py b/echopype/preprocess/__init__.py
index c02ab2268..c6f9606d5 100644
--- a/echopype/preprocess/__init__.py
+++ b/echopype/preprocess/__init__.py
@@ -1,8 +1,7 @@
-from .api import compute_MVBS, compute_MVBS_index_binning, remove_noise, swap_dims_channel_frequency
+from .api import compute_MVBS, compute_MVBS_index_binning, remove_noise
 
 __all__ = [
     "compute_MVBS",
     "compute_MVBS_index_binning",
     "remove_noise",
-    "swap_dims_channel_frequency",
 ]
diff --git a/echopype/preprocess/api.py b/echopype/preprocess/api.py
index cfd2e84b6..bdaeea6f2 100644
--- a/echopype/preprocess/api.py
+++ b/echopype/preprocess/api.py
@@ -326,37 +326,3 @@ def remove_noise(ds_Sv, ping_num, range_sample_num, noise_max=None, SNR_threshol
 
 def regrid():
     return 1
-
-
-def swap_dims_channel_frequency(ds: xr.Dataset) -> xr.Dataset:
-    """
-    Use frequency_nominal in place of channel as dataset dimension and coordinate.
-
-    This is useful because the nominal transducer frequencies are commonly used to
-    refer to data collected from a specific transducer.
-
-    Parameters
-    ----------
-    ds : xr.Dataset
-        Dataset for which the dimension will be swapped
-
-    Returns
-    -------
-    The input dataset with the dimension swapped
-
-    Note
-    ----
-    This operation is only possible when there are no duplicated frequencies present in the file.
-    """
-    # Only possible if no duplicated frequencies
-    if np.unique(ds["frequency_nominal"]).size == ds["frequency_nominal"].size:
-        return (
-            ds.set_coords("frequency_nominal")
-            .swap_dims({"channel": "frequency_nominal"})
-            .reset_coords("channel")
-        )
-    else:
-        raise ValueError(
-            "Duplicated transducer nominal frequencies exist in the file. "
-            "Operation is not valid."
-        )
diff --git a/echopype/tests/consolidate/test_consolidate.py b/echopype/tests/consolidate/test_consolidate.py
index e1cfdbcf6..a7d9162a7 100644
--- a/echopype/tests/consolidate/test_consolidate.py
+++ b/echopype/tests/consolidate/test_consolidate.py
@@ -1,6 +1,169 @@
+import pytest
+
+import numpy as np
+import pandas as pd
+import xarray as xr
+
 import echopype as ep
 
 
+@pytest.fixture(
+    params=[
+        (
+            ("EK60", "DY1002_EK60-D20100318-T023008_rep_freq.raw"),
+            "EK60",
+            None,
+            {},
+        ),
+        (
+            ("EK80_NEW", "D20211004-T233354.raw"),
+            "EK80",
+            None,
+            {'waveform_mode': 'CW', 'encode_mode': 'power'},
+        ),
+        (
+            ("AZFP", "17082117.01A"),
+            "AZFP",
+            ("AZFP", "17041823.XML"),
+            {},
+        ),
+    ],
+    ids=[
+        "ek60_dup_freq",
+        "ek80_cw_power",
+        "azfp",
+    ],
+)
+def test_data_samples(request, test_path):
+    (
+        filepath,
+        sonar_model,
+        azfp_xml_path,
+        range_kwargs,
+    ) = request.param
+    path_model, *paths = filepath
+    filepath = test_path[path_model].joinpath(*paths)
+
+    if azfp_xml_path is not None:
+        path_model, *paths = azfp_xml_path
+        azfp_xml_path = test_path[path_model].joinpath(*paths)
+
+    return (
+        filepath,
+        sonar_model,
+        azfp_xml_path,
+        range_kwargs,
+    )
+
+
+def _check_swap(ds, ds_swap):
+    assert "channel" in ds.dims
+    assert "frequency_nominal" not in ds.dims
+    assert "frequency_nominal" in ds_swap.dims
+    assert "channel" not in ds_swap.dims
+
+
+def test_swap_dims_channel_frequency(test_data_samples):
+    """
+    Test swapping dimension/coordinate from channel to frequency_nominal.
+    """
+    (
+        filepath,
+        sonar_model,
+        azfp_xml_path,
+        range_kwargs,
+    ) = test_data_samples
+    ed = ep.open_raw(filepath, sonar_model, azfp_xml_path)
+    if ed.sonar_model.lower() == 'azfp':
+        avg_temperature = (
+            ed['Environment']['temperature'].mean('time1').values
+        )
+        env_params = {
+            'temperature': avg_temperature,
+            'salinity': 27.9,
+            'pressure': 59,
+        }
+        range_kwargs['env_params'] = env_params
+        if 'azfp_cal_type' in range_kwargs:
+            range_kwargs.pop('azfp_cal_type')
+
+    dup_freq_valueerror = (
+        "Duplicated transducer nominal frequencies exist in the file. "
+        "Operation is not valid."
+    )
+
+    Sv = ep.calibrate.compute_Sv(ed, **range_kwargs)
+    try:
+        Sv_swapped = ep.consolidate.swap_dims_channel_frequency(Sv)
+        _check_swap(Sv, Sv_swapped)
+    except Exception as e:
+        assert isinstance(e, ValueError) is True
+        assert str(e) == dup_freq_valueerror
+
+    MVBS = ep.preprocess.compute_MVBS(Sv)
+    try:
+        MVBS_swapped = ep.consolidate.swap_dims_channel_frequency(MVBS)
+        _check_swap(Sv, MVBS_swapped)
+    except Exception as e:
+        assert isinstance(e, ValueError) is True
+        assert str(e) == dup_freq_valueerror
+
+
+def _build_ds_Sv(channel, range_sample, ping_time, sample_interval):
+    return xr.Dataset(
+        data_vars={
+            "Sv": ( 
+                ("channel", "range_sample", "ping_time"),
+                np.random.random((len(channel), range_sample.size, ping_time.size)),
+            ),
+            "echo_range": (
+                ("channel", "range_sample", "ping_time"),
+                (
+                    np.swapaxes(np.tile(range_sample, (len(channel), ping_time.size, 1)), 1, 2)
+                    * sample_interval
+                ),
+            ),
+        },
+        coords={
+            "channel": channel,
+            "range_sample": range_sample,
+            "ping_time": ping_time,
+        },
+    )
+
+
+def test_add_depth():
+    # Build test Sv dataset
+    channel = ["channel_0", "channel_1", "channel_2"]
+    range_sample = np.arange(100)
+    ping_time = pd.date_range(start="2022-08-10T10:00:00", end="2022-08-10T12:00:00", periods=121)
+    sample_interval = 0.01
+    ds_Sv = _build_ds_Sv(channel, range_sample, ping_time, sample_interval)
+
+    # # no water_level in ds
+    # try:
+    #     ds_Sv_depth = ep.consolidate.add_depth(ds_Sv)
+    # except ValueError:
+    #     ...
+
+    # user input water_level
+    water_level = 10
+    ds_Sv_depth = ep.consolidate.add_depth(ds_Sv, depth_offset=water_level)
+    assert ds_Sv_depth["depth"].equals(ds_Sv["echo_range"] + water_level)
+
+    # user input water_level and tilt
+    tilt = 15
+    ds_Sv_depth = ep.consolidate.add_depth(ds_Sv, depth_offset=water_level, tilt=tilt)
+    assert ds_Sv_depth["depth"].equals(ds_Sv["echo_range"] * np.cos(tilt / 180 * np.pi) + water_level)
+
+    # inverted echosounder
+    ds_Sv_depth = ep.consolidate.add_depth(ds_Sv, depth_offset=water_level, tilt=tilt, downward=False)
+    assert ds_Sv_depth["depth"].equals(-1 * ds_Sv["echo_range"] * np.cos(tilt / 180 * np.pi) + water_level)
+
+    # check attributes
+    assert ds_Sv_depth["depth"].attrs == {"long_name": "Depth", "standard_name": "depth"}
+
+
 def test_add_location(test_path):
     ed = ep.open_raw(
         test_path["EK60"] / "Winter2017-D20170115-T150122.raw",
diff --git a/echopype/tests/preprocess/test_api.py b/echopype/tests/preprocess/test_api.py
deleted file mode 100644
index 550475b18..000000000
--- a/echopype/tests/preprocess/test_api.py
+++ /dev/null
@@ -1,105 +0,0 @@
-import pytest
-
-import echopype as ep
-
-
-@pytest.fixture(
-    params=[
-        (
-            ("EK60", "DY1002_EK60-D20100318-T023008_rep_freq.raw"),
-            "EK60",
-            None,
-            {},
-        ),
-        (
-            ("EK80_NEW", "D20211004-T233354.raw"),
-            "EK80",
-            None,
-            {'waveform_mode': 'CW', 'encode_mode': 'power'},
-        ),
-        (
-            ("AZFP", "17082117.01A"),
-            "AZFP",
-            ("AZFP", "17041823.XML"),
-            {},
-        ),
-    ],
-    ids=[
-        "ek60_dup_freq",
-        "ek80_cw_power",
-        "azfp",
-    ],
-)
-def test_data_samples(request, test_path):
-    (
-        filepath,
-        sonar_model,
-        azfp_xml_path,
-        range_kwargs,
-    ) = request.param
-    path_model, *paths = filepath
-    filepath = test_path[path_model].joinpath(*paths)
-
-    if azfp_xml_path is not None:
-        path_model, *paths = azfp_xml_path
-        azfp_xml_path = test_path[path_model].joinpath(*paths)
-
-    return (
-        filepath,
-        sonar_model,
-        azfp_xml_path,
-        range_kwargs,
-    )
-
-
-def test_swap_dims_channel_frequency(test_data_samples):
-    """
-    Test swapping dimension/coordinate from channel to frequency_nominal.
-    """
-    (
-        filepath,
-        sonar_model,
-        azfp_xml_path,
-        range_kwargs,
-    ) = test_data_samples
-    ed = ep.open_raw(filepath, sonar_model, azfp_xml_path)
-    if ed.sonar_model.lower() == 'azfp':
-        avg_temperature = (
-            ed['Environment']['temperature'].mean('time1').values
-        )
-        env_params = {
-            'temperature': avg_temperature,
-            'salinity': 27.9,
-            'pressure': 59,
-        }
-        range_kwargs['env_params'] = env_params
-        if 'azfp_cal_type' in range_kwargs:
-            range_kwargs.pop('azfp_cal_type')
-
-    dup_freq_valueerror = (
-        "Duplicated transducer nominal frequencies exist in the file. "
-        "Operation is not valid."
-    )
-
-    Sv = ep.calibrate.compute_Sv(ed, **range_kwargs)
-    try:
-        Sv_swapped = ep.preprocess.swap_dims_channel_frequency(Sv)
-        _check_swap(Sv, Sv_swapped)
-    except Exception as e:
-        assert isinstance(e, ValueError) is True
-        assert str(e) == dup_freq_valueerror
-
-    MVBS = ep.preprocess.compute_MVBS(Sv)
-    try:
-        MVBS_swapped = ep.preprocess.swap_dims_channel_frequency(MVBS)
-        _check_swap(Sv, MVBS_swapped)
-    except Exception as e:
-        assert isinstance(e, ValueError) is True
-        assert str(e) == dup_freq_valueerror
-
-
-def _check_swap(ds, ds_swap):
-    assert "channel" in ds.dims
-    assert "frequency_nominal" not in ds.dims
-    assert "frequency_nominal" in ds_swap.dims
-    assert "channel" not in ds_swap.dims

From f421b8eb64c0e2991b6f88d564e000fdd4e83dc8 Mon Sep 17 00:00:00 2001
From: Wu-Jung Lee <leewujung@gmail.com>
Date: Sat, 13 Aug 2022 11:23:26 -0700
Subject: [PATCH 23/23] v0.6.2 release notes (#783)

* add first draft of v0.6.2 release notes

* add additional PRs

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update docs/source/whats-new.md

Co-authored-by: b-reyes <53541061+b-reyes@users.noreply.github.com>

* Update docs/source/whats-new.md

Co-authored-by: b-reyes <53541061+b-reyes@users.noreply.github.com>

* Update docs/source/whats-new.md

Co-authored-by: b-reyes <53541061+b-reyes@users.noreply.github.com>

* move 778 to enhancements

* remove empty line

* update what's new

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: b-reyes <53541061+b-reyes@users.noreply.github.com>
---
 docs/source/whats-new.md | 37 ++++++++++++++++++++++++++++++++++++-
 1 file changed, 36 insertions(+), 1 deletion(-)

diff --git a/docs/source/whats-new.md b/docs/source/whats-new.md
index aa775821b..513a83975 100644
--- a/docs/source/whats-new.md
+++ b/docs/source/whats-new.md
@@ -4,7 +4,42 @@ What's new
 See [GitHub releases page](https://github.com/OSOceanAcoustics/echopype/releases) for the complete history.
 
 
-# v0.6.1 (2022 July 2)
+# v0.6.2 (2022 August 13)
+
+## Overview
+
+This is a minor release that includes a few new features and memory efficiency-related changes that make echopype better.
+
+## New features
+- Add a new subpackage `consolidate` that contains functions to consolidate data across the calibrated Sv dataset and the corresponding raw-converted file
+- Add function `consolidate.add_location` to interpolate location to calibrated dataset (#749)
+- Add function `consolidate.add_depth` to convert `range_meter` to `depth` with information on transducer tilt and depth (#738)
+- Move function that swaps the channel with frequency dimension (now as `consolidate.swap_dims_channel_frequency`) (#738)
+- Add new functionality to allow control of logging outputs (#772)
+
+## Under the hood enhancements
+- Improve memory usage while converting files that require significant NaN-padding and previously would incur very large memory expansion (#774)
+  - This is achieved by directly writing variables that may incur a large memory expansion into a temporary zarr store
+  - Beta function that will benefit from user feedback
+- Overhaul access pattern for EchoData (#762)
+  - Remove previous access pattern for different groups in the raw-converted file
+  - Starting from this release all groups are accessed with `echodata["GROUP_PATH"]`, e.g., `echodata["Platform"]`, `echodata["Sonar/Beam_group1"]`, etc.
+- Make long_name in ds_power for EK80 consistent with other sonar model (#771)
+- Modify `set_beam()` so it returns a list (#780)
+- Change the order in `_save_groups_to_file` so the order of groups is preserved when opening a converted netcdf file (#779)
+- Remove the user option to select NMEA sentences in `open_raw` to ensure raw data is preserved (#778)
+
+## Infrastructure
+- Update CI set up to avoid exceeding GitHub actions memory limitation (#761)
+  - decrease the number of workers from 4 to 2
+
+
+
+
+
+
+
+# v0.6.1 (2022 July 7)
 
 ## Overview