Tag: Perl

Scraping the Dragon with Perl and Mojolicious
Every extended Labor Day weekend, 80,000 fans of pop culture descend on Atlanta for Dragon Con. It’s a sprawling choose-your-own adventure of a convention with 38 programming tracks and over 5,000 hours of events. It spans five downtown host hotels, and there is no way to see it all.

Sadly, this year’s con is almost over. Still, I thought I’d share a little script I wrote to help me make sense of it all.

The official mobile app is fine for searching and bookmarking events, speakers, and exhibitors. Nonetheless, it’s not suitable for scanning the whole landscape at once. I wanted a single, scrollable view of every event, before I even packed my cosplay.

Even in the app’s tablet version, the Dragon Con Events area is a scroll-fest.

The web version of the app gave me exactly what I needed: predictable per-day URLs and semantically marked-up HTML. That meant I can skip the API hunt, skip the manual scrolling, and go straight to scraping.

Inspecting the HTML reveals per-day event URLs and per-event <div> blocks.

From Chaos to Clarity in 40 lines

We’re about to turn a messy, multi-day, multi-hotel schedule into one clean, scroll-once list. This is the forty-five-line Perl map that gets us there, aided by the Mojolicious web toolkit.

Laying the Groundwork: Tools for the Job
```
#!/usr/bin/env perl

use v5.40;

use Carp;
use English;
use Mojo::UserAgent;
use Mojo::URL;
use Mojo::DOM;
use Mojo::Collection q(c);
use Time::Piece;
use HTML::HTML5::Entities;
use Memoize;

binmode STDOUT, ':encoding(UTF-8)'
  or croak "Couldn't encode STDOUT: $OS_ERROR";

my $ua   = Mojo::UserAgent->new();
my $site = Mojo::URL->new('https://app.core-apps.com');
my $path = '/dragoncon25/events/view_by_day';
```
What’s happening: Load the modules that will do the heavy lifting–HTTP fetches, DOM parsing, date handling, Unicode cleanup. Lock STDOUT to UTF‑8 so characters like curly quotes and em-dashes don’t break the output. Point the script at the base schedule URL.

Remembering the Days Without Re-Parsing
```
my $date_from_dom = memoize( sub ($dom) {
  return content_at( $dom, 'div.section_header[class~="alt"]' );
} );
```
What’s happening: Create a memoized helper that plucks the date from a day’s HTML and caches it. That way, if we need it again, we skip the DOM re-parse and keep the pipeline fast.

content_at is a helper function I define later.

Starting Where the App Starts
```
my $today_dom = Mojo::DOM->new( $ua->get("$site$path")->result->text );
```
What’s happening: Fetch the “today” view–the same default the app shows. This is so we have a known starting point for building the full timeline.

Collecting the Whole Timeline
```
my $day_doms = c(
  $today_dom,
  $today_dom->find(qq(div.filter_box-days > a[href^="$path?day="]))
    ->map( \&dom_from_anchor )
    ->to_array->@*,
)->sort( sub { day_epoch($a) <=> day_epoch($b) } );
```
What’s happening: Grab every day link from the filter bar, fetch each day’s HTML, and sort them chronologically. Now we’ve got the entire con’s schedule in memory, ready to process.

dom_from_anchor and day_epoch are two more helper functions explained further down.

Turning HTML into a Human-Readable Schedule
```
$day_doms->each( sub {    # process each day's events
  my $date = $date_from_dom->($_);

  $_->find('a.bookmark[data-type="events"] + a.object_link')
    ->each( sub {         # output start time + title

      my $time    = content_at( $_, 'div.line[class~="two"]' );
      my $title   = content_at( $_, 'div.line[class~="one"]' );
      my ($start) = split /\s*\p{Dash_Punctuation}/, $time;

      say "$date $start: ", decode_entities($title);
    } );
} );
```
What’s happening: For each day, find every event link and pull out the start time and title. Split the time cleanly on any dash and decode HTML entities so the output reads like a real schedule.

The Little Routines That Make It All Work
```
sub dom_from_anchor ($dom) {    # fetch DOM for a day link
  return Mojo::DOM->new(
    $ua->get( Mojo::URL->new( $dom->attr('href') )->to_abs($site) )
      ->result->text );
}

sub day_epoch ($dom) {    # parse date into epoch
  return Time::Piece->strptime( $date_from_dom->($dom), '%A, %b %e' )
    ->epoch;
}

# extract and trim text from selector
sub content_at ( $dom, @args ) { return trim $dom->at(@args)->content }
```
What’s happening:
1. dom_from_anchor: fetch and parses a linked days’ HTML.
2. day_epoch: turn a date string into a sort-able epoch.
3. content_at: extract and trim text from a DOM fragment, given a CSS selector.
These helpers keep the main flow readable and re-usable.
The Schedule, Unlocked

Run the script and you get a clean, UTF-8-safe list of every event, in chronological order, across all days. No swiping around, no tapping, no “what did I miss?” anxiety. (Ha, who am I kidding? There’s too much going on at Dragon Con to not end up missing something.)

An example run of the script in my terminal. Each line is “Day, Date Time: Event Title”, sorted chronologically across the whole con.

And here’s just a small slice of the 2,500+ lines it produces:

Sunday, Aug 31 11:30 AM: Unmasking Sherlock: Beyond the Many Faces
Sunday, Aug 31 11:30 AM: Weaponization of the FCC and Other Agencies to Chill Speech
Sunday, Aug 31 11:30 AM: Where Physics Gets Weird
. . .
Sunday, Aug 31 11:50 AM: Photo Session: Amelia Tyler
Sunday, Aug 31 11:50 AM: Photo Session: Cissy Jones
Sunday, Aug 31 11:50 AM: Photo Session: Emma Gregory
. . .
Sunday, Aug 31 12:00 PM: Dragon Con Mashups
Sunday, Aug 31 12:00 PM: James J. Butcher and R.R. Virdi signing at The Missing Volume booth# 1300
Sunday, Aug 31 12:00 PM: JoeDan Worley and Eric Dontigney signing at the Shadow Alley Press Booth# 2
. . .
Sunday, Aug 31 12:00 PM: Photo Session: Robert Duncan McNeill
Sunday, Aug 31 12:00 PM: Photo Session: Robert Picardo
Sunday, Aug 31 12:00 PM: Photo Session: Tamara Taylor
Key Techniques

Here’s the fun part–the techniques that make this tidy, scroll-once list possible.

CSS selectors for precision

I used a.bookmark[data-type="events" + a.object_link] to grab only the event title links, and div.line[class~="two" /div.line[class~="one"] for time and title, respectively. This avoids scraping unrelated elements.

Memoization for efficiency

memoize caches the date string for each day’s DOM so I didn’t end up re-parsing the HTML fragment multiple times.

Unicode-safe splitting

\p{Dash_Punctuation} matches any dash type (em, en, hyphen-minus, etc.), so I could split times reliably without worrying about which dash the site uses.

Functional chaining

Mojo::Collection’s map, sort, and each methods let me express the scrape→transform→output pipeline in a linear, readable way.

Entity decoding at output

HTML::HTML5::Entities’ decode_entities is applied right before printing, so HTML entities like & or " are human-readable in the final output.
A Pattern You Can Take Anywhere

The same approach that tamed Dragon Con’s chaos works anywhere you’ve got:
- Predictable URLs–so you can iterate without guesswork
- Consistent HTML structure–so your selectors stay stable
- A need to see everything at once–so you can make decisions without paging or filtering
From fan conventions to conference schedules, from local sports fixtures to film festival line‑ups–the same pattern applies. Sometimes the right tool isn’t a sprawling framework or heavyweight API client. It’s a forty‑odd‑line Perl script that does one thing with ruthless clarity.

Because once you’ve tamed a schedule like this, the only lines you’ll stand in are the ones that feel like part of the show.
August 31, 2025

Even lighter Perl modulinos with Util::H2O::More

A few weeks ago, I wrote about how to use the modulino pattern in Perl to create unit-testable command line tools. Fellow Houston Perl Monger Brett Estrade pointed me to a different approach on the Perl Applications & Algorithms Discord. This approach trims boilerplate while keeping scripts testable.

Brett’s Utils::H2O::More module amends the lightweight class builder Utils::H2O. It adds many extra methods, including command line argument processing via the Perl-packaged Getopt::Long module. It also promises to build its accessors with less ceremony and code than Moo.

So let’s dive in!

A simple script

A script can use Util::H2O::More’s Getopt2h2o function to process command line options. It returns an object with accessors for each parameter.

Here’s a simple example, modeled after my earlier modulino exercise:

#!/usr/bin/env perl

use v5.38;
use Util::H2O::More v0.4.2 qw(Getopt2h2o);

# name is a string, water is an array of strings
my $o = Getopt2h2o \@ARGV, {}, qw(
    name=s
    water=s@
);
die "Missing --name\n" unless $o->name;

printf "Good %s, %s!\n", time_of_day(), $o->name;

if ( defined $o->water and $o->water->@* ) {
    say 'What kind of water would you like?';
    say "- $_" for $o->water->@*;
}

sub time_of_day {
    my %hours = (
         5 => 'morning',
        12 => 'afternoon',
        17 => 'evening',
        21 => 'night',
    );

    for ( sort { $b <=> $a } keys %hours ) {
        return $hours{$_} if (localtime)[2] >= $_;
    }
    return 'night';
}

This is short and readable. I like how Getopt::Long’s quirky parameter parsing syntax is repurposed to create accessors, even for multi-valued options.

You can call this script like so:

./h2options.pl --name Aquarius --water hot --water cold

We had to check that --name was set, as Util::H2O::More doesn’t support using an = modifier to specify required options. This is something Getopt::Long allows.

And as Util::H2O’s documentation suggests: “You should probably switch to something like Moo instead [for advanced features].”

But enough about limitations–what if you wanted to use this as a Perl module for testing?

Testing the waters

One of the strengths of a modulino is the ability to unit test its logic without invoking it from the shell. A typical test script looks like this:

#!/usr/bin/env perl

use v5.38;
use Test2::V0;
use modulinh2o;

plan(6);

can_ok(
    'modulinh2o',
    [ 'time_of_day', 'name', 'water' ],
    'class has methods',
);
my $water = modulinh2o->new( name => 'Aquarius' );
isa_ok( $water, ['modulinh2o'],
        'object is expected class' );
can_ok(
    $water,
    [ 'time_of_day', 'name', 'water' ],
    'object has methods',
);

is( $water->name,        'Aquarius', 'name set' );
is( $water->name('Bob'), 'Bob',      'name change' );
is( $water->time_of_day,
    in_set( qw(
        morning
        afternoon
        evening
        night
    ) ),
    'time of day function',
);

It isn’t difficult to adapt a modulino from our earlier simple script:

#!/usr/bin/env perl

use v5.38;

package modulinh2o;

use Getopt::Long qw();
use Util::H2O::More v0.4.2 qw(h2o opt2h2o);

my @opt_spec = qw(
    name=s
    water=s@
);
my $o = h2o -classify => __PACKAGE__, {},
        opt2h2o(@opt_spec);

sub time_of_day {
    my %hours = (
         5 => 'morning',
        12 => 'afternoon',
        17 => 'evening',
        21 => 'night',
    );

    for ( sort { $b <=> $a } keys %hours ) {
        return $hours{$_} if (localtime)[2] >= $_;
    }
    return 'night';
}

# constructor that parses arguments w/ basic validation
sub new_with_options ($class) {
    Getopt::Long::GetOptionsFromArray(
      \@ARGV, $o, @opt_spec
    ) or die "bad options\n";
    die "Missing --name\n" unless $o->name;
    return $o;
}

sub run ($self) {
    printf "Good %s, %s!\n",
           time_of_day(), $self->name;

    if ( defined $self->water and $self->water->@* ) {
        say 'What kind of water would you like?';
        say "- $_" for $self->water->@*;
    }
    return;
}

package main;

main() unless caller;

sub main { modulinh2o->new_with_options->run() }

And run:

./modulinh2o.pm --name Aquarius \
  --water sparkling --water still

This is much shorter than the Moo-based modulino from three weeks ago, but it also doesn’t do as much. There’s no support for using comma separators to pass multiple values to a single argument. Worse, there’s no automatic help text if one passes the wrong options.

Both are fixable, as we’ll see in a moment. Still, you end up having to write the POD yourself, printed out with various invocations of Pod::Usage’s pod2usage() function.

An ounce of script is worth a gallon of documentation

Here’s a full example that adds both --help and --man command line options, as typically provided by traditional Getopt::Long-based scripts:

#!/usr/bin/env perl

use v5.38;

package modulinh2o2;

use Getopt::Long qw();
use Util::H2O::More v0.4.2 qw(h2o opt2h2o);
use Pod::Usage;

my @opt_spec = qw(
    name=s
    water=s@

    help
    man
);
my $o = h2o -classify => __PACKAGE__, {},
        opt2h2o(@opt_spec);

sub time_of_day {
    my %hours = (
         5 => 'morning',
        12 => 'afternoon',
        17 => 'evening',
        21 => 'night',
    );

    for ( sort { $b <=> $a } keys %hours ) {
        return $hours{$_} if (localtime)[2] >= $_;
    }
    return 'night';
}

# different parameter mixes for pod2usage()
my %pod2usage_opt = (
    cmdline => {
        -exitval  => 2,
        -verbose  => 99,
        -sections => 'USAGE/Command line',
        -message  => 'Use --help to list options',
    },
    opts => {
        -exitval  => 0,
        -verbose  => 99,
        -sections => ['USAGE/Command line', 'OPTIONS'],
    },
    man => {
        -exitval => 0,
        -verbose => 2,
    },
);

sub new_with_options ($class) {
    Getopt::Long::GetOptionsFromArray(
      \@ARGV, $o, @opt_spec
    ) or pod2usage( %pod2usage_opt{cmdline} );

    pod2usage( $pod2usage_opt{opts} ) if $o->help;
    pod2usage( $pod2usage_opt{man} )  if $o->man;
    pod2usage( %pod2usage_opt{cmdline},
      -message => 'Missing --name',
    ) unless $o->name;

    # default values for the water parameter
    $o->water(
          ( defined $o->water and $o->water->@* )
        ? [ split /,/, join q{,}, $o->water->@* ]
        : [ qw(
            still
            sparkling
            tap
        ) ] );

    return $o;
}

sub run ($self) {
    printf "Good %s, %s!\n",
           time_of_day(), $self->name;

    say 'What kind of water would you like?';
    say "- $_" for $self->water->@*;

    return;
}

package main;

main() unless caller;

sub main { modulinh2o2->new_with_options->run() }

# the rest below is documentation

__END__

=head1 NAME

modulinh2o2 - demo of a modulino using Util::H2O::More

=head1 USAGE

=head2 Command line

    modulinh2o2.pm [options]

=head2 Perl

    use modulinh2o2;
    my $water = modulinh2o2->new(
        name  => 'Aquarius',
        water => [ qw(
            sparkling
            still
            tap
        ) ],
    );
    $water->run;

=head1 OPTIONS

=over

=item B<--name>

Your name here! (required)

=item B<--water>

Type of water to serve. May be specified multiple times, either by repeating the option or separated by commas.

Takes an arrayref when used as a method or construction parameter.

Default values:

=over

=item still

=item sparkling

=item tap

=back

=item B<--help>

Displays a brief help message.

=item B<--man>

Display full documentation as a manual page.

=back

=head1 DESCRIPTION

A sample L<Util::H2O::More> modulino that prints your name, what part of the day it is, and a menu of water choices.

=head1 METHODS

=head2 new

Constructor that takes the above L</OPTIONS> but without the preceding C<-->.

=head2 new_with_options

Alternate constructor that receives its parameters from C<@ARGV>.

=head2 run

Prints out a greeting along with a selection of refreshing drinks.

=head2 ACCESSOR OPTIONS

All of the L</OPTIONS> above are also available as method accessors but without the preceding C<-->.

=head1 FUNCTIONS

=head2 time_of_day

Returns the period of the day based on local time. Possible values are:

=over

=item morning

=item afternoon

=item evening

=item night

=back

=head1 AUTHOR

Mark Gardner <[email protected]>

=head1 LICENSE AND COPYRIGHT

This software is copyright (c) 2025 by Mark Gardner.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

Now we can run either:

./modulinh2o2.pm --help

to get a quick help message, or:

./modulinh2o2.pm --man

to show the full manual page.

Yes, over half of the line count is documentation. Even granting this the code amount is now comparable to the Moo-based version. And you don’t get the advantage of MooX::Options’ declarative syntax.

To summarize, here’s a table charting the evolution of our modulinh2o:

Stage	Pros	Cons	Complexity
Simple script: `Getopt2h2o` only	* Minimal code footprint * Instant accessors from CLI arguments * Multi-valued options via @ syntax	* No required-argument enforcement * No auto-help * Manual validation	Low: about 20 lines, one file
Basic modulino: `h2o` + `opt2h2o`	* Testable as a module * Reusable `new_with_options` constructor * Keeps brevity vs. Moo	* Still no comma-separated multi-values * No auto-help on bad arguments	Medium: adds package structure, test harness
Full modulino w/help & man: Pod::Usage integrated	* Support for `--help` and `--man` * Comma-separated multi-values handled * Defaults for missing `--water` * Clearer on-boarding for new users	* More boilerplate * Over half the file is POD * Still less declarative than MooX::Options	High: more moving parts, but user-friendly

Util::H2O::More modulino evolution and feature trade-offs

Seen together, these stages trace a course from a bare-bones script to a well-provisioned modulino. Each step adds features at the cost of a little more complexity.

In the end, Util::H2O::More delivers a lean, testable modulino with far less boilerplate than Moo–but also fewer built‑in niceties. If you value speed from idea to working script and can live without declarative option handling, it’s a compelling choice. For more structured needs, MooX::Options still has the edge. Either way, the path from script to production-ready modulino is shorter than you think.

The best tool is the one that flows from idea to “it works” in a single pour.

August 24, 2025

Kaiju Boss Battle: A Dist::Zilla Journey from Chaos to Co-Op
Last week I wrote about developing a Perl module enabling me to output log entries to macOS’ unified logging system. This weekend’s adventure involved porting that module’s manual processes. These processes included dependency management, documentation syncing, version bumping, and release. The goal was to make everything more automated and repeatable.

And maybe even… monstrous.

I was already familiar with the Dist::Zilla (DZil to its friends) suite of tools and plugins. I also knew that some Perl developers view it as a huge barrier to entry. This perception affects their willingness to contribute to others’ projects.

So even though Log-Any-Adapter-MacOS-OSLog was a small module of interest to a limited coding audience (macOS Perl users), I thought it wise to have both a main branch and separate build/main branch for those programmers who wanted to work as though things had barely changed:
- Entire source code with full POD-formatted documentation;
- A Makefile.PL script to generate a portable building, testing, and installing Makefile;
- Plus, every Perl distribution should supply the typical README, MANIFEST, LICENSE, and CONTRIBUTING documentation. This is essential if it’s meant for public consumption.
A small distribution would also give me a model I can scale up for larger projects. At the very least, it was another learning opportunity for me.

Boy, was it.

Why Dist::Zilla?

Because I know it can automate away the boilerplate code and repetition of information that’s unfortunately necessary in a modern Perl module distribution:
- the version numbers in every module and script
- the README that often includes the same introductory text as the main module’s documentation
- the naming, order, and content of POD sections (via DZil’s sister suite, Pod::Weaver), some of which repeat distribution metadata like author, version, support, copyright, license, and so on
If you need further detail, Dan Book’s Dist::Zilla::Starter is, as its name suggests. an excellent and remarkably thorough guide to the how and why of Dist::Zilla. It even covers the basic structure of CPAN distributions and the history of Perl module authoring tools.

Why not something else?

Because just as in the story of Goldilocks and the three bears, everything else I looked at seemed wanting in some way:
- Module::Build/Module::Build::Tiny: Although minimal, pure-Perl, and easy to install, Module::Build proper still requires shipping extra boilerplate and duplicate metadata. ::Tiny shaves that down further but drops whole swaths of functionality. As an example, you can’t specify at setup time that users need a specific operating system. It’s a deal-breaker for a module that requires macOS 10.12 Sierra or newer.
- Minilla: Opinionated convention over configuration, but I don’t share its opinions. Overriding directory layout, test structure, and README/license handling would mean fighting Minilla’s defaults, DZil-config style, without DZil’s plugin ecosystem.
- ShipIt: Simple, one-file configuration. But too simple: it’s mostly release automation and not authoring automation. Everything I wanted to tool away is still manual.
- No or minimal toolchain: That was how I started this module, with ExtUtils::MakeMaker. Totally manual, with every contributor, including yours truly, needing to remember all the moving parts by hand.
Off to see the lizard!

My DZil dist.ini configuration file started off simply enough:
```
name    = Log-Any-Adapter-MacOS-OSLog
author  = Mark Gardner <[email protected]>
license = Perl_5
copyright_holder = Mark Gardner
version = 0.0.5 ; bump as appropriate

[MetaResources]
repository.type = git
bugtracker.web  = https://codeberg.org/mjgardner/perl-Log-Any-Adapter-MacOS-OSLog/issues

[Repository]
web = https://codeberg.org/mjgardner/perl-Log-Any-Adapter-MacOS-OSLog
```
That [MetaResources] is the first evidence of using plugins. It provides a nice tidy way to specify the metadata. Automated tools can use this information to index, examine, package, or install Perl distributions.

So far so good. But then the monster attacked.
```
[@Filter]
-bundle = @Basic
-remove = GatherDir
-remove = MakeMaker
-remove = Readme
; License plugin still active here

[GatherDir]
include_dotfiles = 1
exclude_filename = .gitignore
exclude_filename = .build

[ReadmeAnyFromPod]
type = markdown
filename = README.md
location = build

[CopyFilesFromBuild]
copy = README.md

; various MakeMaker::*, Git::*, Pod::Weaver, etc. plugins
```
I was getting errors from the dzil --build command as it attempted to add the same file multiple times. These were generated files committed to the main version control branch and a build/main branch for non-DZil contributors.

This brought me to a dead stop. I went down a rabbit-hole examining built files from rsync(1) with [Run::AfterBuild]., comparing them to the branch I had set up.

Plus my .perlcriticrc file was disappearing from the build artifacts despite that instruction to [GatherDir] to include dotfiles.

“Let them fight.”

LICENSE to kill (files)

Eventually I traced the LICENSE file duplication to dueling DZil plugins. The aforementioned [GatherDir] was dutifully copying the file from my working root even as the [License] plugin was generating it. I didn’t want to lose the latter source of whatever license I happened to be using to distribute this code.

And .perlcriticrc? It turns out the [PruneCruft] plugin doesn’t listen to its friend [GatherDir] and was hoovering it away.

In the end, I had to expand my manually-configured plugin roster a little to keep them from fighting over what files came from where:
```
[@Filter]
-bundle = @Basic
-remove = GatherDir
-remove = PruneCruft
-remove = MakeMaker
-remove = Readme

[PruneCruft]
except = .perlcriticrc

[GatherDir]
include_dotfiles = 1
exclude_filename = .gitignore
exclude_filename = .build
exclude_filename = LICENSE

[ReadmeAnyFromPod]
type            = markdown
filename        = README.md

[CopyFilesFromBuild]
copy = README.md
copy = LICENSE
```
With these tweaks to Log-Any-Adapter-MacOS-OSLog’s dist.ini:
- [License] generates its file,
- [CopyFilesFromBuild] copies it along with the README into the root for my commit,
- And [GatherDir] doesn’t stomp on it like so many Tokyo city blocks.
The build/main branch for non-DZil contributors would contain exactly what they and I expected. It would be a match of the DZil working copy plus artifacts, git cloneable with no surprises.

Lessons learned, and what’s next?

I wanted to serve two styles of development and had signed myself up for a complicated authoring process. I now have more knowledge of which plugin runs in each phase of the build. This allows me to decide between generated and version-controlled files.

CPAN has a rich variety of personal Dist::Zilla::PluginBundle::s factoring individual authors’ preferences into a single place. Those authors don’t have to copy-and-paste DZil configurations around. It’s past time for me to mint one of my own bundles for more consistent and well-understood Perl distributions. Then I can start spinning up new projects without revisiting the same pain.

This weekend has brought on a mental shift. I moved from fighting DZil’s defaults to making it work my way. I also found satisfaction in a predictable, minimal-effort Perl release pipeline.

The monster was just a guy in a rubber suit all along.
August 17, 2025
Logging from Perl to macOS’ unified log with FFI and Log::Any
Part 1: The elephant in the room

A few weeks ago, I started hosting my own Mastodon instance on a Mac mini in my home office. I wanted to join the social Fediverse on my own terms–but it didn’t take long to notice ballooning disk usage. Cached media from other users’ posts was piling up fast.

That got me thinking: how do I track this growth before it gets out of hand?

Logging seemed like the obvious answer. On Unix and Linux systems, it’s straightforward enough. But on macOS, finding a native, maintainable solution takes more digging.

Part 2: Feeding the Apple

macOS is Unix-based, so you’d expect logging to be simple. You can install logrotate via Homebrew, then schedule it with cron(8). It works–but it adds layers of configuration files, permissions, and guesswork. I wanted something native. Something that felt like it belonged on a Mac.

Turns out, macOS offers two built-in options. One is newsyslog, a BSD-style tool that rotates logs based on size or time. It’s reliable, but it requires privileged root-owned configuration files and feels like a holdover from older Unix systems.

The other is Apple’s unified logging system–a modern API used across macOS, iOS, and even watchOS. It’s structured, searchable, and already baked into the platform. That’s the one I decided to explore.

Howard Oakley’s explainer on the Unified Log helped me understand Apple’s system for consolidating logs. It showed how they are stored in a compressed binary format, complete with structured metadata and privacy controls. With that foundation, I turned to Apple’s OSLog Framework documentation. It showed how to tag entries and filter them with predicates. macOS handles the rest.

It’s elegant–but you need to use the API to write logs. Yes, reading and filtering can be done on the command line or in the Console app. But Apple seems to expect logging to be the sole province of Swift and Objective‑C developers. I’d rather not have to learn a new programming language just to write logs.

UPDATE: Howard Oakley’s blowhole utility provides a simple way to write to the unified log from the command line, but all messages come from the “co.eclecticlight.blowhole” subsystem with a “general” category. We can do better.

Part 3: A platypus in the key of C

I do know Perl. I also know just enough C to be dangerous. And I briefly considered learning Swift or Objective‑C. Nevertheless, I wondered about bridging Perl to Apple’s unified logging system without switching languages.

macOS exposes a C API in <os/log.h>:
```
#include <os/log.h>

void
os_log(os_log_t log, const char *format, ...);

void
os_log_info(os_log_t log, const char *format, ...);

void
os_log_debug(os_log_t log, const char *format, ...);

void
os_log_error(os_log_t log, const char *format, ...);

void
os_log_fault(os_log_t log, const char *format, ...);
```
Perl’s CPAN has a module called FFI::Platypus that would let me call foreign functions in C and other languages. It looked promising.

But there’s a catch: these logging functions are variadic macros, not plain functions. That makes them inaccessible via FFI. Worse, they expand into private API calls–unstable across OS updates and risky to rely upon.

So I wrote a small C wrapper to convert each macro into a proper function. This makes them FFI-safe and lets me control visibility (public logging vs. private, redacted logging) using Apple’s format specifiers:
```
#include <os/log.h>

#define DEFINE_OSLOG_WRAPPERS(level_macro, suffix)    \
    void os_log_##suffix##_public(os_log_t log,       \
                                  const char *msg) {  \
        level_macro(log, "%{public}s", msg);          \
    }                                                 \
    void os_log_##suffix##_private(os_log_t log,      \
                                   const char *msg) { \
        level_macro(log, "%{private}s", msg);         \
    }

// Generate wrappers for each log level
DEFINE_OSLOG_WRAPPERS(os_log, default)
DEFINE_OSLOG_WRAPPERS(os_log_info, info)
DEFINE_OSLOG_WRAPPERS(os_log_debug, debug)
DEFINE_OSLOG_WRAPPERS(os_log_error, error)
DEFINE_OSLOG_WRAPPERS(os_log_fault, fault)
```
This macro generates two functions per log level–one public, one private–giving downstream Perl code a choice. It’s verbose, but it’s safe, auditable, and future-proof.

Part 4: Plugging into Log::Any

With the wrapper library in place, I began mapping Apple’s log levels to something Perl can use. I chose Log::Any from CPAN because it’s lightweight, widely supported, and its adapters don’t lock you into a specific back-end. The same code that logs to the screen can also log to a file, or in our case, Apple’s system.

Admittedly, at this point I’m no longer writing a simple logging script for my Mastodon instance. Instead, it’s a full-fledged logging module. Oh well.

Some Log::Any levels share the same underlying Apple call– OSLog doesn’t distinguish between notice and info or trace and debug. That’s a little different from how Unix syslog does things, but that’s fine. The goal here is compatibility, not perfect fidelity.

Building a simple dispatch table to route log messages based on level, I then used FFI::Platypus to bind each wrapper function:
```
use FFI::Platypus 2.00;

my %OS_LOG_MAP = (
    trace     => 'os_log_debug',
    debug     => 'os_log_debug',
    info      => 'os_log_info',
    notice    => 'os_log_info',
    warning   => 'os_log_fault',
    error     => 'os_log_error',
    critical  => 'os_log_default',
    alert     => 'os_log_default',
    emergency => 'os_log_default',
);

my $ffi = FFI::Platypus->new(
    api => 2,
    lib => [ './liboslogwrapper.dylib' ],
);

$ffi->attach(
    [ os_log_create => '_os_log_create' ],
    [ 'string', 'string' ],
    'opaque',
);

# attach each wrapper function
my %UNIQUE_OS_LOG = map { $_ => 1 } values %OS_LOG_MAP;
foreach my $function ( keys %UNIQUE_OS_LOG ) {
    for my $variant (qw(public private)) {
        my $name = "${function}_$variant";
        $ffi->attach(
            [ $name => "_$name" ],
            [ 'opaque', 'string' ],
            'void',
        );
    }
}
```
This setup gives me a clean way to log from Perl using Apple’s native system. I can achieve this without touching Swift, Objective‑C, or external tools. Each log level maps to a C wrapper, and the FFI layer handles the rest.

Now I just need an init function to create the os_log_t object and a set of methods for logging and detecting whether a given log level is enabled:
```
use strict;
use Carp;
use base qw(Log::Any::Adapter::Base);
use Log::Any::Adapter::Util qw(
  detection_methods
  numeric_level
);

sub init {
    my $self = shift;
    $self->{private} ||= 0;
    croak 'subsystem is required'
      unless defined $self->{subsystem};

    $self->{_os_log} = _os_log_create(
      @{$self}{qw(subsystem category)},
    );

    return;
}

foreach my $log_level ( keys %OS_LOG_MAP ) {
    no strict 'refs';
    *{$log_level} = sub {
        my ( $self, $message ) = @_;

        &{  "_$OS_LOG_MAP{$log_level}_"
                . ( $self->{private}
                    ? 'private'
                    : 'public'
                ) }( $self->{_os_log}, $message );
    };
}

foreach my $method ( detection_methods() ) {
    my $method_level = numeric_level(substr $method 3);
    no strict 'refs';
    *{$method} = sub {
        !!( $method_level <= (
          $_[0]->{log_level} // numeric_level('info')
        ) );
    };
}
```
What’s that “subsystem” bit up there? That’s the term macOS uses for identifying processes in logs. They’re usually formatted in reverse DNS notation (e.g., “com.example.perl”). Once again, Howard Oakley has a great explainer on the topic.

Also, there’s some metaprogramming going on there:
- The first foreach loop creates functions called trace, debug, and info. These functions call the corresponding FFI::Platypus-created functions. It uses the private variants if the private attribute for the log adapter was set.
- The second foreach loop creates creates functions called is_trace, is_debug, is_info, etc., that return true if the adapter is catching that level of log message.
Part 5: At long last, logging… mostly

Once this is packaged in a Perl module, how do you use it? At least that part isn’t too hard:
```
use Log::Any '$log', default_adapter => [
  'MacOS::OSLog', subsystem => 'com.phoenixtrap.perl',
];
use English;
use Carp qw(longmess);

$log->info('Hello from Perl!');
$log->infof('You are using Perl %s', $PERL_VERSION);

$log->trace( longmess('tracing!') );
$log->debug(     'debugging!'     );
$log->info(      'informing!'     );
$log->notice(    'noticing!'      );
$log->warning(   'warning!'       );
$log->error(     'erring!'        );
$log->critical(  'critiquing!'    );
$log->alert(     'alerting!'      );
$log->emergency( 'emerging!'      );
```
And then you can run this command line to stream log messages from the subsytem used above:
```
% log stream --level debug \
  --predicate 'subsystem == "com.phoenixtrap.perl"
```
What happened to the trace and debug log messages that were supposed to call os_log_debug(3)? According to macOS’ log(1) manual page, you have to explicitly allow debugging output for a given subsystem:
```
% sudo log config --mode "level:debug" \
  --subsystem com.phoenixtrap.perl
```
Et voilà!

Hmm, same lack of debugging messages.

I’m still figuring this out. Any clues? Drop me a line!

UPDATE: This is now fixed thanks to some inspiration from the source code of Log::Any::Adapter::Syslog. I’ve updated the code on Codeberg; here is the diff.

Bonus: Fancy output

Thanks to Log::Any::Proxy, you also get sprintf formatting variant functions:
```
use English;
$log->infof(
    'You are using Perl %s in %d',
    $PERL_VERSION, (localtime)[5] + 1900,
);
```
```
You are using Perl v5.40.2 in 2025
```
If you output an object that overloads string representation, you get that string:
```
use DateTime;
$log->infof('It is now %s', DateTime->now);
```
```
It is now 2025-08-10T20:16:50
```
And you get single-line Data::Dumper output of complex data structures, plus replacing undefined values with the string “undef”:
```
$log->info( {
    foo    => 'hello',
    bar    => 'world',
    colors => [ qw(
        red
        green
        blue
    ) ],
    null => undef,
} );
```
```
{bar => "world",colors => ["red","green","blue"],foo => "hello",null => undef}
```
Conclusion: Build once, use everywhere

The best tools aren’t always the ones you planned to build. They’re the ones that solve a problem cleanly–and then solve five more you hadn’t thought of yet.

What started as a quick fix for Mastodon media monitoring became a reusable bridge between Perl and macOS’ Unified Log. Along the way, I got to explore Apple’s logging internals, write an FFI-respecting C wrapper, and integrate cleanly with Log::Any. The resulting code is modular, auditable, and–most importantly–maintainable.

I didn’t set out to write a logging adapter. But when you care about clean ops and reproducible infrastructure, sometimes the best tools are the ones you build yourself. And if they happen to be over-engineered for the task at hand? All the better–they’ll probably outlive it.

Try it out or contribute!

The full adapter code is on Codeberg. If you’re logging from Perl on macOS, give it a spin. Contributions, bug reports, and real-world feedback are welcome–especially if you’re testing it in production or on older macOS versions.

I’ll do my best to stay compatible with past and future macOS and Perl releases. Keeping the code auditable and minimal should help it stay useful without becoming a moving target.
August 10, 2025
Lightweight object-oriented Perl scripts: From modulinos to moodulinos
Last week I found myself developing a Perl script to catalog some information for our quality assurance team. Unfortunately, as these things sometimes do, the script’s complexity and requirements started increasing. I still wanted to keep it as a simple script. Yet, it was growing command line arguments that needed extra validation. I also needed to test some functions without waiting for the entire script to run.

As with many things Perl, the basic solution is fairly old. Over twenty years ago, brian d foy popularized the modulino pattern. in which Perl scripts that you execute from the command line can also act as Perl modules. You can even use these modules in other contexts, for example testing.* A modulino seemed like the perfect solution for testing individual script functions, but writing object-oriented Perl outside of a framework (or the new Perl class syntax) can be challenging and verbose.

Enter the cow (Moo)

The Moo system of modules are billed as a lightweight way “to concisely define objects and roles with a convenient syntax that avoids the details of Perl’s object system.” It doesn’t have any XS code. Thus, it doesn’t need a C compiler to install. Unlike its inspiration, Moose, it’s optimized for the fast startup time needed for a command-line script. Sure, you don’t get a full-strength meta-object protocol for querying and manipulating classes, objects, and attributes—those capabilities are concerns for larger applications or libraries. In keeping with the lightweight theme, you can use Type::Tiny constraints for parameter validation. Additionally, there are several solutions for turning command-line arguments into object attributes. (I chose to use MooX::Options, mainly because of its easy availability as an Ubuntu Linux package.)

I’m not about to dump a proprietary script here on my blog. Yet, I have worked up an illustrative example of how to incorporate Moo into a modulino. Call it a “moodulino” if you like; here’s a short-ish script to tell Perl just how you feel at this time of day:
```
#!/usr/bin/env perl

use v5.38;

package moodulino;
use Moo;
use MooX::Options;
use Types::Standard qw(ArrayRef Str);

option name => (
    is       => 'ro',
    isa      => Str,
    required => 1,
    short    => 'n',
    doc      => 'your name here',
    format   => 's',
);

option moods => (
    is        => 'ro',
    isa       => ArrayRef [Str],
    predicate => 1,
    short     => 'm',
    doc       => 'a list of how you might feel',
    format    => 's@',
    autosplit => ',',
);

has time_of_day => (
    is      => 'ro',
    isa     => Str,
    builder => 1,
);

sub _build_time_of_day ($self) {
    my %hours = (
         5 => ‘morning’,
        12 => 'afternoon',
        17 => 'evening',
        21 => 'night',
    );

    for ( sort { $b <=> $a } keys %hours ) {
        return $hours{$_} if (localtime)[2] >= $_;
    }
    return 'night';
}

sub run ($self) {
    printf "Good %s, %s!\n",
      $self->time_of_day,
      $self->name;

    if ( $self->has_moods ) {
        say 'How are you feeling?';
        say "- $_?" for $self->moods->@*;
    }
}

package main;

main() unless caller;

sub main { moodulino->new_with_options->run() }
```
And here’s what happens when I run it:
```
% chmod a+x moodulino.pm
% ./moodulino.pm
name is missing
USAGE: moodulino.pm [-h] [long options ...]

    -m --moods=[Strings]  a list of how you might feel
    -n --name=String      your name here

    --usage               show a short help message
    -h                    show a compact help message
    --help                show a long help message
    --man                 show the manual
% ./moodulino.pm --name Mark
Good afternoon, Mark!
% ./moodulino.pm —name Mark --moods happy --moods sad --moods excited
Good afternoon, Mark!
How are you feeling?
- happy?
- sad?
- excited?
% ./moodulino.pm —name Mark --moods happy,sad,excited
Good afternoon, Mark!
How are you feeling?
- happy?
- sad?
- excited?
```
If the mood strikes, I can even write a test script for my script:
```
#!/usr/bin/env perl

use v5.38;
use Test2::V0;
use moodulino;

plan(3);

my $mood = moodulino->new( name => 'Bessy' );
isa_ok( $mood, 'moodulino' );
can_ok( $mood, 'time_of_day' );

is( $mood->time_of_day,
    in_set( qw(
        morning
        afternoon
        evening
        night
    ) ) );
```
And run it:
```
% prove -I. t/time_of_day.t
t/daytime.t .. ok
All tests successful.
Files=1, Tests=3,  0 wallclock secs ( 0.00 usr  0.00 sys +  0.07 cusr  0.01 csys =  0.08 CPU)
Result: PASS
```
* foy later expanded this idea into the chapter “Modules as Programs” in Mastering Perl (2007). You can also read more in his 2014 article “Rescue legacy code with modulinos”. Also explore Gábor Szabó’s articles on the topic. ↩︎
August 3, 2025

Tag: Perl

Scraping the Dragon with Perl and Mojolicious

From Chaos to Clarity in 40 lines

Laying the Groundwork: Tools for the Job

Remembering the Days Without Re-Parsing

Starting Where the App Starts

Collecting the Whole Timeline

Turning HTML into a Human-​Readable Schedule

The Little Routines That Make It All Work

The Schedule, Unlocked

Key Techniques

CSS selectors for precision

Memoization for efficiency

Unicode-​safe splitting

Functional chaining

Entity decoding at output

A Pattern You Can Take Anywhere

Even lighter Perl modulinos with Util::H2O::More

A simple script

Testing the waters

An ounce of script is worth a gallon of documentation

Kaiju Boss Battle: A Dist::Zilla Journey from Chaos to Co-Op

Why Dist::Zilla?

Why not something else?

Off to see the lizard!

LICENSE to kill (files)

Lessons learned, and what’s next?

Logging from Perl to macOS’ unified log with FFI and Log::Any

Part 1: The elephant in the room

Part 2: Feeding the Apple

Part 3: A platypus in the key of C

Part 4: Plugging into Log::Any

Part 5: At long last, logging… mostly

Bonus: Fancy output

Conclusion: Build once, use everywhere

Try it out or contribute!

Lightweight object-​oriented Perl scripts: From modulinos to moodulinos

Enter the cow (Moo)

Turning HTML into a Human-Readable Schedule

Unicode-safe splitting

Lightweight object-oriented Perl scripts: From modulinos to moodulinos