<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="assets/xml/rss.xsl" media="all"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data mining</title><link>https://www.data-mining.co.nz/</link><description>Commercial AI at the University of Waikato</description><atom:link href="https://www.data-mining.co.nz/rss.xml" rel="self" type="application/rss+xml"></atom:link><language>en</language><copyright>Contents © 2026 &lt;a href="mailto:fracpete@waikato.ac.nz"&gt;Applied Machine Learning Group, University of Waikato&lt;/a&gt; </copyright><lastBuildDate>Thu, 05 Mar 2026 22:10:01 GMT</lastBuildDate><generator>Nikola (getnikola.com)</generator><docs>http://blogs.law.harvard.edu/tech/rss</docs><item><title>Yolo26 Docker images available</title><link>https://www.data-mining.co.nz/news/2026-02-26-yolo26-docker/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;Docker images for &lt;a class="reference external" href="https://github.com/ultralytics/ultralytics/"&gt;Yolo26&lt;/a&gt; are now available.&lt;/p&gt;
&lt;p&gt;The code used by the docker images is available from here:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://github.com/waikato-datamining/pytorch/tree/master/yolo26"&gt;github.com/waikato-datamining/pytorch/tree/master/yolo26&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;The tags for the images are as follows:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;In-house registry:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;code class="docutils literal"&gt;&lt;span class="pre"&gt;harbor.cms.waikato.ac.nz/public/pytorch/pytorch-yolo26:8.4.16_cuda12.6&lt;/span&gt;&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;code class="docutils literal"&gt;&lt;span class="pre"&gt;harbor.cms.waikato.ac.nz/public/pytorch/pytorch-yolo26:8.4.16_cpu&lt;/span&gt;&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Docker hub:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;code class="docutils literal"&gt;&lt;span class="pre"&gt;waikatodatamining/pytorch-yolo26:8.4.16_cuda12.6&lt;/span&gt;&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;code class="docutils literal"&gt;&lt;span class="pre"&gt;waikatodatamining/pytorch-yolo26:8.4.16_cpu&lt;/span&gt;&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The tutorial on &lt;em&gt;object detection&lt;/em&gt; is available from here:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://www.data-mining.co.nz/applied-deep-learning/object_detection/yolo26/"&gt;www.data-mining.co.nz/applied-deep-learning/object_detection/yolo26/&lt;/a&gt;&lt;/p&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2026-02-26-yolo26-docker/</guid><pubDate>Wed, 25 Feb 2026 23:51:00 GMT</pubDate></item><item><title>PaddleX Docker Image release</title><link>https://www.data-mining.co.nz/news/2025-11-04-paddlex-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;The first Docker images for the &lt;a class="reference external" href="https://github.com/PaddlePaddle/PaddleX"&gt;PaddleX&lt;/a&gt; framework are now available:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://github.com/waikato-datamining/paddlex"&gt;https://github.com/waikato-datamining/paddlex&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;According to the project's website:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePaddle framework. It integrates
numerous ready-to-use pre-trained models, enabling full-process development from model training to inference,
supporting a variety of mainstream hardware both domestic and international, and aiding AI developers in industrial
practice.&lt;/em&gt;&lt;/p&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-11-04-paddlex-release/</guid><pubDate>Tue, 04 Nov 2025 07:43:00 GMT</pubDate></item><item><title>spectral-data-converter release</title><link>https://www.data-mining.co.nz/news/2025-10-31-sdc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-datamining/spectral-data-converter-all"&gt;spectral-data-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.1.0&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;Just like with the &lt;a class="reference external" href="https://www.data-mining.co.nz/news/2025-10-31-idc-release/"&gt;image-dataset-converter-all&lt;/a&gt;, this release benefits from
the major overhaul under the hood and the integration of the &lt;cite&gt;kasperl&lt;/cite&gt; libraries.&lt;/p&gt;
&lt;p&gt;Check out the &lt;a class="reference external" href="https://www.data-mining.co.nz/spectral-data-converter-examples/"&gt;examples&lt;/a&gt; as well.&lt;/p&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-10-31-sdc-release/</guid><pubDate>Fri, 31 Oct 2025 07:43:00 GMT</pubDate></item><item><title>audio-dataset-converter release</title><link>https://www.data-mining.co.nz/news/2025-10-31-adc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-llm/audio-dataset-converter-all"&gt;audio-dataset-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.1.0&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;Just like with the &lt;a class="reference external" href="https://www.data-mining.co.nz/news/2025-10-31-idc-release/"&gt;image-dataset-converter-all&lt;/a&gt;, this release benefits from
the major overhaul under the hood and the integration of the &lt;cite&gt;kasperl&lt;/cite&gt; libraries.&lt;/p&gt;
&lt;p&gt;Check out the &lt;a class="reference external" href="https://waikato-llm.github.io/audio-dataset-converter-examples/"&gt;examples&lt;/a&gt; as well.&lt;/p&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-10-31-adc-release/</guid><pubDate>Fri, 31 Oct 2025 07:01:00 GMT</pubDate></item><item><title>image-dataset-converter release</title><link>https://www.data-mining.co.nz/news/2025-10-31-idc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-datamining/image-dataset-converter-all"&gt;image-dataset-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.1.0&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;This release represents a major overhaul under the hood and lots of new functionality has been added as well.
With this release it is now possible to write more complex/integrated/reactive pipelines with the added I/O and
email plugins.&lt;/p&gt;
&lt;p&gt;It is always worth checking out the &lt;a class="reference external" href="https://www.data-mining.co.nz/image-dataset-converter-examples/"&gt;examples&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The most notably changes since 0.0.12 are:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;a new release of &lt;cite&gt;seppl&lt;/cite&gt; allows for filters that support m-to-n not just 1-to-1 conversions&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;list-to-sequence&lt;/cite&gt; stream filter that forwards list items one by one&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;common code among the image-dataset-converter, audio-dataset-converter and spectral-data-converter libraries
has been extracted and moved into separate libraries to reduce duplication: &lt;cite&gt;kasperl&lt;/cite&gt;, &lt;cite&gt;kasperl_redis&lt;/cite&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the new &lt;cite&gt;kasperl_plots&lt;/cite&gt; library adds basic support for terminal plots (textual and graphical) and matplotlib ones&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the filters &lt;cite&gt;tee&lt;/cite&gt;, &lt;cite&gt;trigger&lt;/cite&gt; and &lt;cite&gt;sub-process&lt;/cite&gt; support conditional execution based on meta-data evaluations now,
as well as loading their sub-pipeline from a file to break up large pipelines into smaller, logical chunks&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;block&lt;/cite&gt;, &lt;cite&gt;stop&lt;/cite&gt; filters for controlling the flow of data (via meta-data conditions)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the &lt;cite&gt;idc-exec&lt;/cite&gt; tool now uses all trailing arguments as the pipeline to execute multiple times rather than as a single
argument to a flag; alternatively, the pipeline can be loaded from a file&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the &lt;cite&gt;idc-convert&lt;/cite&gt; tool can load a pipeline from a file now as well&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added the &lt;cite&gt;text-file&lt;/cite&gt; and &lt;cite&gt;csv-file&lt;/cite&gt; generators that work off files to populate the variable(s)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the readers &lt;cite&gt;from-grayscale-dp&lt;/cite&gt;, &lt;cite&gt;from-indexed-png-is&lt;/cite&gt;, &lt;cite&gt;from-blue-channel-is&lt;/cite&gt; and &lt;cite&gt;from-grayscale-is&lt;/cite&gt; now
support reading only the annotations&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;from-text-file&lt;/cite&gt; reader and &lt;cite&gt;to-text-file&lt;/cite&gt; writer&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added a number of I/O related plugins: &lt;cite&gt;list-files&lt;/cite&gt;, &lt;cite&gt;move-files&lt;/cite&gt;, &lt;cite&gt;delete-files&lt;/cite&gt;, &lt;cite&gt;copy-files&lt;/cite&gt;, &lt;cite&gt;watch-dir&lt;/cite&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added email support with &lt;cite&gt;get-email&lt;/cite&gt; reader and &lt;cite&gt;send-email&lt;/cite&gt; writer&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;console&lt;/cite&gt; writer for outputting the data on stdout that is coming through&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;count-specks&lt;/cite&gt; filter that adds counts of small objects to meta-data&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added support for caching plugins via &lt;cite&gt;IDC_CLASS_CACHE&lt;/cite&gt; environment variable&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;is-to-od&lt;/cite&gt; filter that generates object detection annotations from contours determined in image segmentation layers&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;to-metadata&lt;/cite&gt; writer that outputs the meta-data of an image&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;attach-metadata&lt;/cite&gt; filter that loads meta-data from a directory and attaches it to the data passing through&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;load-data&lt;/cite&gt; filter to turn file names into data containers&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;annotation-to-storage&lt;/cite&gt; and &lt;cite&gt;annotation-from-storage&lt;/cite&gt; filters&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;delete-storage&lt;/cite&gt; filter for removing objects from internal storage&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;annotation data is now being type-checked when setting it&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-10-31-idc-release/</guid><pubDate>Fri, 31 Oct 2025 02:45:00 GMT</pubDate></item><item><title>spectral-data-converter release</title><link>https://www.data-mining.co.nz/news/2025-07-11-sdc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-datamining/spectral-data-converter-all"&gt;spectral-data-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.0.2&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;This release contains couple of major of changes:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;&lt;a class="reference external" href="https://www.data-mining.co.nz/spectral-data-converter-examples/directio/"&gt;support for direct I/O&lt;/a&gt;: most readers/writers can operate on file-like objects now as well&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a class="reference external" href="https://www.data-mining.co.nz/spectral-data-converter-examples/zip/"&gt;reading from/writing to ZIP files&lt;/a&gt;: &lt;cite&gt;from-zip&lt;/cite&gt;, &lt;cite&gt;to-zip&lt;/cite&gt;&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-07-11-sdc-release/</guid><pubDate>Thu, 10 Jul 2025 23:23:00 GMT</pubDate></item><item><title>image-dataset-converter release</title><link>https://www.data-mining.co.nz/news/2025-07-11-idc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-datamining/image-dataset-converter-all"&gt;image-dataset-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.0.12&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;The most notably changes since 0.0.11 are:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;dropped numpy&amp;lt;2.0.0 restriction&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added grayscale-to-binary filter&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;fix: sort-pixels, rgb-to-grayscale filters&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;the rename filter now supports lower/upper case placeholders of name and extension as well&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;requiring seppl&amp;gt;=0.2.17 now for skippable plugin support and avoiding deprecated use of pkg_resources&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added any-to-rgb filter for turning binary/grayscale images back into RGB ones&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added label-to-metadata filter for transferring labels into meta-data&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added metadata-to-placeholder filter for transferring meta-data into placeholders&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added basic support for images with associated depth information: DepthData, DepthInformation&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added depth-to-grayscale filter for converting depth information to grayscale image&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added depth information readers from-grayscale-dp, from-numpy-dp, from-csv-dp and from-pfm-dp&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added depth information writers to-grayscale-dp, to-numpy-dp, to-csv-dp and to-pfm-dp&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added apply-ext-mask filter to applying external PNG masks to image containers (image and/or annotations)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added apply-label-mask filter for applying image segmentation label masks to their base images&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added label-present-ic and label-present-is that ensure that certain label(s) are present or otherwise discard the image&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;filter label-present was renamed to label-present-od but keeping label-present as alias for the time being&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;fix: imgseg_to_bluechannel, imgseg_to_indexedpng and imgseg_to_grayscale now handle overlapping pixels correctly, no longer adding them up and introducing additional labels&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;discard-by-name filter can use names of files in specified paths now as well&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;fixed the construction of the error messages in the pyfunc reader/filter/writer classes&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-07-11-idc-release/</guid><pubDate>Thu, 10 Jul 2025 22:23:00 GMT</pubDate></item><item><title>llm-dataset-converter release</title><link>https://www.data-mining.co.nz/news/2025-07-11-ldc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;Version &lt;strong&gt;0.2.7&lt;/strong&gt; of our &lt;a class="reference external" href="https://github.com/waikato-llm/llm-dataset-converter"&gt;llm_dataset_converter&lt;/a&gt; library has
been release. New release of ldc_doc, ldc_docx, ldc_faster_whisper, ldc_google, ldc_openai, ldc_pdf and ldc_tint
have been made available as well.&lt;/p&gt;
&lt;p&gt;The meta-library that combines all the libraries now stands at version &lt;strong&gt;0.0.6&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://github.com/waikato-llm/llm-dataset-converter-all"&gt;llm-dataset-converter-all&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;A new Docker image is available as well:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://hub.docker.com/r/waikatodatamining/llm-dataset-converter/tags"&gt;https://hub.docker.com/r/waikatodatamining/llm-dataset-converter/tags&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This release is mostly a maintenance release, but still had some useful additions:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;set-placeholder&lt;/cite&gt; filter for dynamically setting (temporary) placeholders at runtime&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;remove-strings&lt;/cite&gt; filter that just removes sub-strings&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;strip-strings&lt;/cite&gt; filter for stripping whitespaces from start/end of strings&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-07-11-ldc-release/</guid><pubDate>Thu, 10 Jul 2025 21:20:00 GMT</pubDate></item><item><title>audio-dataset-converter release</title><link>https://www.data-mining.co.nz/news/2025-07-10-adc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;A new release of our &lt;a class="reference external" href="https://github.com/waikato-llm/audio-dataset-converter"&gt;audio-dataset-converter&lt;/a&gt;
library and it various additional dependent libraries is out.&lt;/p&gt;
&lt;p&gt;The meta-library that combines all the libraries now stands at version &lt;strong&gt;0.0.3&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://github.com/waikato-llm/audio-dataset-converter-all"&gt;audio-dataset-converter-all&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;A new Docker image is available as well:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://hub.docker.com/r/waikatodatamining/audio-dataset-converter/tags"&gt;https://hub.docker.com/r/waikatodatamining/audio-dataset-converter/tags&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Notable changes:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;improved support for placeholders via the &lt;cite&gt;set-placeholder&lt;/cite&gt; and &lt;cite&gt;metadata-to-placeholder&lt;/cite&gt; filters&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added &lt;cite&gt;from-multi&lt;/cite&gt; and &lt;cite&gt;to-multi&lt;/cite&gt; for combining multiple readers/writers&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added the &lt;cite&gt;--resume_from&lt;/cite&gt; option to readers to allow resuming the processing from a specific file&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;added the &lt;cite&gt;--split_group&lt;/cite&gt; option ti writers: a regular expression with a single group used for keeping
items in the same split, e.g., for identifying the base name of a file or the ID&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-07-10-adc-release/</guid><pubDate>Thu, 10 Jul 2025 01:07:00 GMT</pubDate></item><item><title>spectral-data-converter release</title><link>https://www.data-mining.co.nz/news/2025-06-27-sdc-release/</link><dc:creator>Applied Machine Learning Group, University of Waikato</dc:creator><description>&lt;p&gt;The first release of our &lt;a class="reference external" href="https://github.com/waikato-datamining/spectral-data-converter-all"&gt;spectral-data-converter-all&lt;/a&gt; library
is now available: &lt;strong&gt;0.0.1&lt;/strong&gt;. Docker images have been deployed as well.&lt;/p&gt;
&lt;p&gt;This library allows you to define and run processing pipelines on the command-line, e.g., for:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;converting data from one format into another (e.g., OPUS to NIR)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;clean the data (e.g., IQR)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;transform the data (e.g., SIMPLS, PLS1, standardize)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;build and apply scikit-learn models&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You can find examples for various scenarios here:&lt;/p&gt;
&lt;p&gt;&lt;a class="reference external" href="https://www.data-mining.co.nz/spectral-data-converter-examples/"&gt;data-mining.co.nz/spectral-data-converter-examples/&lt;/a&gt;&lt;/p&gt;</description><category>release</category><guid>https://www.data-mining.co.nz/news/2025-06-27-sdc-release/</guid><pubDate>Thu, 26 Jun 2025 22:23:00 GMT</pubDate></item></channel></rss>