NAME
WordListRole::Bloom - Provide word_exists() that uses bloom filter
VERSION
This document describes version 0.007 of WordListRole::Bloom (from Perl
distribution WordListRole-Bloom), released on 2022-08-20.
SYNOPSIS
In your lib/WordList/EN/Foo.pm:
package WordList::EN::Foo;
use parent 'WordList';
use Role::Tiny::With;
with 'WordListRole::Bloom';
__DATA__
word1
word2
...
In your share/bloom, create your bloom filter data file, e.g. with
bloomgen:
% perl -ne 'print if (/^__DATA__$/ .. 0) && $i++' lib/WordList/EN/Foo.pm | \
bloomgen -n 1234 -p 0.1% > share/bloom
(where "-n" is set to the number of words, "-p" to the maximum
false-positive rate).
After that, in yourscript.pl:
my $wl = WordList::EN::Foo->new;
$wl->word_exists("foo"); # uses bloom filter to check for existence.
DESCRIPTION
This role provides an alternative "word_exists()" method that checks a
bloom filter located in the distribution share directory (share/bloom).
This provides a low startup-overhead way to check an item against a big
list (e.g. millions). Note that testing using a bloom filter can result
in a false positive (i.e. "word_exists()" returns true but the word is
not actually in the list.
PROVIDED METHODS
word_exists
HOMEPAGE
Please visit the project's homepage at
<https://metacpan.org/release/WordListRole-Bloom>.
SOURCE
Source repository is at
<https://github.com/perlancar/perl-WordListRole-Bloom>.
SEE ALSO
AUTHOR
perlancar <perlancar@cpan.org>
CONTRIBUTING
To contribute, you can send patches by email/via RT, or send pull
requests on GitHub.
Most of the time, you don't need to build the distribution yourself. You
can simply modify the code, then test via:
% prove -l
If you want to build the distribution (e.g. to try to install it locally
on your system), you can install Dist::Zilla,
Dist::Zilla::PluginBundle::Author::PERLANCAR,
Pod::Weaver::PluginBundle::Author::PERLANCAR, and sometimes one or two
other Dist::Zilla- and/or Pod::Weaver plugins. Any additional steps
required beyond that are considered a bug and can be reported to me.
COPYRIGHT AND LICENSE
This software is copyright (c) 2022, 2020 by perlancar
<perlancar@cpan.org>.
This is free software; you can redistribute it and/or modify it under
the same terms as the Perl 5 programming language system itself.
BUGS
Please report any bugs or feature requests on the bugtracker website
<https://rt.cpan.org/Public/Dist/Display.html?Name=WordListRole-Bloom>
When submitting a bug or request, please include a test-file or a patch
to an existing test-file that illustrates the bug or desired feature.