# NAME Database::Abstraction - read-only database abstraction layer # VERSION Version 0.13 # SYNOPSIS Abstract class giving read-only access to CSV, XML and SQLite databases via Perl without writing any SQL, using caching for performance optimization. It offers functionalities like opening the database and fetching data based on various criteria, Look for databases in $directory in this order: 1) SQLite (file ends with .sql) 2) PSV (pipe separated file, file ends with .psv) 3) CSV (file ends with .csv or .db, can be gzipped) (note the default sep\_char is '!' not ',') 4) XML (file ends with .xml) The AUTOLOAD feature allows for convenient access to database columns using method calls. If the table has a key column, entries are keyed on that and sorts are based on it. To turn that off, pass 'no\_entry' to the constructor, for legacy reasons it's enabled by default. The key column's default name is 'entry', but it can be overridden by the 'id' parameter. CSV files that are not no\_entry can have empty lines or comment lines starting with '#', to make them more readable. For example, you can access the files in /var/db/foo.csv via this class: package MyPackageName::Database::Foo; use Database::Abstraction; our @ISA = ('Database::Abstraction'); # Regular CSV: There is no entry column and the separators are commas sub new { my $class = shift; my %args = (ref($_[0]) eq 'HASH') ? %{$_[0]} : @_; return $class->SUPER::new(no_entry => 1, sep_char => ',', %args); } You can then access the data using: my $foo = MyPackageName::Database::Foo->new(directory => '/var/dat'); print 'Customer name ', $foo->name(customer_id => 'plugh'), "\n"; my $row = $foo->fetchrow_hashref(customer_id => 'xyzzy'); print Data::Dumper->new([$row])->Dump(); # SUBROUTINES/METHODS ## init Initializes the abstraction class and its subclasses with optional arguments for configuration. Database::Abstraction::init(directory => '../data'); See the documentation for new to see what variables can be set. Returns a reference to a hash of the current values. Therefore when given with no arguments you can get the current default values: my $defaults = Database::Abstraction::init(); print $defaults->{'directory'}, "\n"; ## new Create an object to point to a read-only database. Arguments: Takes different argument formats (hash or positional) cache => place to store results; cache\_duration => how long to store results in the cache (default is 1 hour); directory => where the database file is held max\_slurp\_size => CSV/PSV/XML files smaller than this are held in RAM (default is 16K) If the arguments are not set, tries to take from class level defaults. Checks for abstract class usage. Slurp mode assumes that the key column (entry) is unique. If it isn't, searches will be incomplete. Turn off slurp mode on those databases, by setting a low value for max\_slurp\_size. Clones existing objects with or without modifications. Uses Carp::carp to log warnings for incorrect usage or potential mistakes. ## set\_logger Sets class or code reference that will be used for logging. ## selectall\_hashref Returns a reference to an array of hash references of all the data meeting the given criteria. Note that since this returns an array ref, optimisations such as "LIMIT 1" will not be used. Use caching if that is available. ## selectall\_hash Similar to selectall\_hashref but returns an array of hash references. ## fetchrow\_hashref Returns a hash reference for a single row in a table. Special argument: table: determines the table to read from if not the default, which is worked out from the class name ## execute Execute the given SQL query on the database. In an array context, returns an array of hash refs, in a scalar context returns a hash of the first row On CSV tables without no\_entry, it may help to add "WHERE entry IS NOT NULL AND entry NOT LIKE '#%'" to the query. If the data have been slurped, this will still work by accessing that actual database. ## updated Returns the timestamp of the last database update. ## AUTOLOAD Directly access a database column. Returns all entries in a column, a single entry based on criteria. Uses cached data if available. Returns an array of the matches, or only the first when called in scalar context If the database has a column called "entry" you can do a quick lookup with my $value = $foo->column('123'); # where "column" is the value you're after my @entries = $foo->entry(); print 'There are ', scalar(@entries), " entries in the database\n"; Set distinct or unique to 1 if you're after a unique list. # AUTHOR Nigel Horne, `` # BUGS The default delimiter for CSV files is set to '!', not ',' for historical reasons. I really ought to fix that. It would be nice for the key column to be called key, not entry, however key's a reserved word in SQL. The no\_entry parameter should be no\_id. XML slurping is hard, so if XML fails for you on a small file force non-slurping mode with $foo = MyPackageName::Database::Foo->new({ directory => '/var/db', max_slurp_size => 1 # force to not use slurp and therefore to use SQL }); # LICENSE AND COPYRIGHT Copyright 2015-2024 Nigel Horne. This program is released under the following licence: GPL2. Usage is subject to licence terms. The licence terms of this software are as follows: Personal single user, single computer use: GPL2 All other users (for example Commercial, Charity, Educational, Government) must apply in writing for a licence for use from Nigel Horne at the above e-mail.