Overengineering Title Preferences for VNDB

(Published on 2022-12-23)

Alternative title: Fighting with the PostgreSQL query planner.

What people want me to work on: cool new features and user interface updates for VNDB.
What I actually work on: rewriting existing features and silly microbenchmarks.

In this post I elaborate on the technical implementation of one particular site feature and my attempts to improve on it.

Context: Title Preferences

The VNDB schema has these two tables:

CREATE TABLE vn (
    id    vndbid   PRIMARY KEY,
    olang language NOT NULL, -- "original language"
    -- and more
);

CREATE TABLE vn_titles (
    id       vndbid   NOT NULL, -- refers to vn.id
    lang     language NOT NULL,
    official boolean  NOT NULL,
    title    text     NOT NULL,
    latin    text,
    PRIMARY KEY(id, lang)
);

That is to say, we’ve got visual novel (vn) entries, and each entry can have a bunch of titles, one for each language it has been published in. The vn.olang column tells us which language the VN has originally been published in and hence which title to consider as primary. Titles can be unofficial and titles written in a non-latin script (CJK, Cyrillic, etc) also have a transcription to the latin alphabet.

When a VN entry is displayed on the website, the latin title of the original language (or just the title if latin is NULL) is used as the main title, and, when applicable, the title in the original script is used as an alternative title. The alternative title is often displayed below the main title or used as HTML “title” attribute in listings and links. To take G-senjou no Maou as an example:

Annotated screenshot of the VNDB G-senjou no Maou page

People who can read Japanese will generally not care for that latin transcription, while other people may prefer to see an English title when one is available. Hence, VNDB lets users configure which titles to display. These settings are pretty flexible: users can list multiple languages, assign a priority to each and indicate whether to accept only original or official titles. The selection algorithm for the main and alternative titles can be configured separately. Here’s an elaborate example.

More Context: Current Implementation

To ease the migration from the old situation where a VN entry only had a single title, I implemented this feature in SQL by (ab)using VIEWs.

The database schema now has the following view, which acts as an alias to the vn table and adds a title and alttitle column:

CREATE VIEW vnt AS
  SELECT v.*
       , COALESCE(vo.latin, vo.title) AS title
       , CASE WHEN vo.latin IS NULL THEN '' ELSE vo.title END AS alttitle
    FROM vn v
    JOIN vn_titles vo ON vo.id = v.id AND vo.lang = v.olang;

Adjusting the code to select from this view instead of the vn table was trivial and the performance is perfectly fine. With that view in place, there’s a simple path towards supporting title preferences: just temporarily replace that view with a different query that implements the user’s preferred title selection. This is made even easier because, in PostgreSQL, temporary views are defined inside a special session-local schema and that schema is searched before the others. So all that is needed to switch to a configuration where English is preferred for the main title is the following query¹.

CREATE OR REPLACE TEMPORARY VIEW vnt AS
  SELECT v.*
       , COALESCE(ve.title, vo.latin, vo.title) AS title
       , CASE WHEN vo.latin IS NULL THEN '' ELSE vo.title END AS alttitle
    FROM vn v
    JOIN vn_titles vo ON vo.id = v.id AND vo.lang = v.olang
    LEFT JOIN vn_titles ve ON ve.id = v.id AND ve.lang = 'en';

While abusing views for configuration is perhaps a bit weird, this particular solution was motivated by two other constraints:

“If you’re generating the SQL from the application anyway, couldn’t you inline the preferences as a subquery rather than using a view?” That would be a nicer solution, but there’s a problem. I also have this item_info() support function that takes an identifier of any database object - including VNs - and returns some generic information about that object - including the title. That title should also take preferences into account, so I either need to have a way to pass the preferences as a function argument or… simply have it select from a VIEW. The item_info() function is also too useful to just let go.

“How about just passing all available titles to the application and implement the preferences there?” Would not be a terrible solution either, except that I also need a way to sort large-ish lists on the displayed title. I’d rather not do that in the application as performance would be a nightmare.

That is not to say the current abuse of views is necessarily a great solution either, it comes with two main drawbacks:

The CREATE VIEW command isn’t super slow, but also not instant. It has to be run at the start of each pageview (when switching from a previous configuration, at least), and we’ll need to issue a command for each database type that supports title preferences. Currently that’s only two (VNs and releases), but I’d like to increase that number and that will further increase page generation times even for pages that don’t display any titles at all.
There are some scenarios where we need to query an object title (through item_info() or other means) without applying the user’s preferences, but switching preferences for each query is much too slow. My current workaround is to remove the temporary schema from the search_path so that the temporary view is not visible to the query being run, but that’s a pretty shitty hack.

A Little Experiment

In search for a better solution, I was wondering if I could implement the title selection entirely within SQL. That is, in such a way that the users’ preferences are passed around as some value and that the query to fetch the correct title is then generated within SQL. PostgreSQL supports writing functions that can be used in the context of a table or view, and I knew it could, if stars aligned correctly, also optimize queries through function boundaries. My initial attempt to write such a function looked like this:

CREATE OR REPLACE FUNCTION vnt(p langprefs) RETURNS SETOF vnt_type AS $$
  SELECT v.*
     , coalesce(
         CASE WHEN p.t1_latin THEN t1.latin ELSE NULL END, t1.title,
         CASE WHEN p.t2_latin THEN t2.latin ELSE NULL END, t2.title,
         CASE WHEN p.t3_latin THEN t3.latin ELSE NULL END, t3.title,
         CASE WHEN p.t4_latin THEN t4.latin ELSE NULL END, t4.title,
         CASE WHEN p.t5_latin THEN t5.latin ELSE NULL END, t5.title
       ) AS title
     , coalesce(
         CASE WHEN p.a1_latin THEN a1.latin ELSE NULL END, a1.title,
         CASE WHEN p.a2_latin THEN a2.latin ELSE NULL END, a2.title,
         CASE WHEN p.a3_latin THEN a3.latin ELSE NULL END, a3.title,
         CASE WHEN p.a4_latin THEN a4.latin ELSE NULL END, a4.title,
         CASE WHEN p.a5_latin THEN a5.latin ELSE NULL END, a5.title
       ) AS alttitle
    FROM vn v
    LEFT JOIN vn_titles t1 ON t1.id = v.id AND t1.lang = COALESCE(p.t1_lang, v.olang) AND (NOT p.t1_official OR t1.official)
    LEFT JOIN vn_titles t2 ON t2.id = v.id AND t2.lang = COALESCE(p.t2_lang, v.olang) AND (NOT p.t2_official OR t2.official) AND p.t1_lang IS NOT NULL
    LEFT JOIN vn_titles t3 ON t3.id = v.id AND t3.lang = COALESCE(p.t3_lang, v.olang) AND (NOT p.t3_official OR t3.official) AND p.t1_lang IS NOT NULL AND p.t2_lang IS NOT NULL
    LEFT JOIN vn_titles t4 ON t4.id = v.id AND t4.lang = COALESCE(p.t4_lang, v.olang) AND (NOT p.t4_official OR t4.official) AND p.t1_lang IS NOT NULL AND p.t2_lang IS NOT NULL AND p.t3_lang IS NOT NULL
    LEFT JOIN vn_titles t5 ON t5.id = v.id AND t5.lang = COALESCE(p.t5_lang, v.olang) AND (NOT p.t5_official OR t5.official) AND p.t1_lang IS NOT NULL AND p.t2_lang IS NOT NULL AND p.t3_lang IS NOT NULL AND p.t4_lang IS NOT NULL
    LEFT JOIN vn_titles a1 ON a1.id = v.id AND a1.lang = COALESCE(p.a1_lang, v.olang) AND (NOT p.a1_official OR a1.official)
    LEFT JOIN vn_titles a2 ON a2.id = v.id AND a2.lang = COALESCE(p.a2_lang, v.olang) AND (NOT p.a2_official OR a2.official) AND p.a1_lang IS NOT NULL
    LEFT JOIN vn_titles a3 ON a3.id = v.id AND a3.lang = COALESCE(p.a3_lang, v.olang) AND (NOT p.a3_official OR a3.official) AND p.a1_lang IS NOT NULL AND p.a2_lang IS NOT NULL
    LEFT JOIN vn_titles a4 ON a4.id = v.id AND a4.lang = COALESCE(p.a4_lang, v.olang) AND (NOT p.a4_official OR a4.official) AND p.a1_lang IS NOT NULL AND p.a2_lang IS NOT NULL AND p.a3_lang IS NOT NULL
    LEFT JOIN vn_titles a5 ON a5.id = v.id AND a5.lang = COALESCE(p.a5_lang, v.olang) AND (NOT p.a5_official OR a5.official) AND p.a1_lang IS NOT NULL AND p.a2_lang IS NOT NULL AND p.a3_lang IS NOT NULL AND p.a4_lang IS NOT NULL
$$ LANGUAGE SQL STABLE;

Not the most elegant function ever written and also not fully correct, but it serves as a starting point. It can be used in place of the vnt view:

SELECT id, title, alttitle
  FROM vnt('(en,,,,,,,,,,t,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f)'::langprefs) x
 WHERE NOT hidden AND id < 'v10';

The configuration I’m using here is the same as the English-main-title example used above. To my surprise, the corresponding query plan isn’t even awful:

                                                                                     QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Nested Loop Left Join  (cost=655.67..4101.26 rows=195 width=68) (actual time=0.053..3.255 rows=9 loops=1)
   Join Filter: false
   ->  Nested Loop Left Join  (cost=655.67..4099.31 rows=195 width=315) (actual time=0.052..3.252 rows=9 loops=1)
         Join Filter: false
         ->  Nested Loop Left Join  (cost=655.67..4097.36 rows=195 width=284) (actual time=0.052..3.250 rows=9 loops=1)
               Join Filter: false
               ->  Nested Loop Left Join  (cost=655.67..4095.41 rows=195 width=253) (actual time=0.052..3.249 rows=9 loops=1)
                     Join Filter: false
                     ->  Nested Loop Left Join  (cost=655.67..4093.46 rows=195 width=222) (actual time=0.052..3.247 rows=9 loops=1)
                           ->  Nested Loop Left Join  (cost=655.38..2933.01 rows=195 width=195) (actual time=0.049..3.236 rows=9 loops=1)
                                 Join Filter: false
                                 ->  Nested Loop Left Join  (cost=655.38..2931.06 rows=195 width=164) (actual time=0.049..3.235 rows=9 loops=1)
                                       Join Filter: false
                                       ->  Nested Loop Left Join  (cost=655.38..2929.11 rows=195 width=133) (actual time=0.048..3.233 rows=9 loops=1)
                                             Join Filter: false
                                             ->  Nested Loop Left Join  (cost=655.38..2927.16 rows=195 width=102) (actual time=0.048..3.231 rows=9 loops=1)
                                                   ->  Hash Right Join  (cost=655.09..1766.71 rows=195 width=71) (actual time=0.042..3.215 rows=9 loops=1)
                                                         Hash Cond: (t1.id = v.id)
                                                         ->  Seq Scan on vn_titles t1  (cost=0.00..1081.40 rows=11514 width=67) (actual time=0.002..2.689 rows=11535 loops=1)
                                                               Filter: (lang = 'en'::language)
                                                               Rows Removed by Filter: 33617
                                                         ->  Hash  (cost=652.65..652.65 rows=195 width=8) (actual time=0.018..0.019 rows=9 loops=1)
                                                               Buckets: 1024  Batches: 1  Memory Usage: 9kB
                                                               ->  Bitmap Heap Scan on vn v  (cost=5.86..652.65 rows=195 width=8) (actual time=0.008..0.016 rows=9 loops=1)
                                                                     Recheck Cond: (id < 'v10'::vndbid)
                                                                     Filter: (NOT hidden)
                                                                     Heap Blocks: exact=9
                                                                     ->  Bitmap Index Scan on vn_pkey  (cost=0.00..5.81 rows=203 width=0) (actual time=0.003..0.003 rows=9 loops=1)
                                                                           Index Cond: (id < 'v10'::vndbid)
                                                   ->  Index Scan using vn_titles_pkey on vn_titles t2  (cost=0.29..5.95 rows=1 width=39) (actual time=0.001..0.001 rows=1 loops=9)
                                                         Index Cond: ((id = v.id) AND (lang = COALESCE(v.olang)))
                                             ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
                                                   One-Time Filter: false
                                       ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
                                             One-Time Filter: false
                                 ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
                                       One-Time Filter: false
                           ->  Index Scan using vn_titles_pkey on vn_titles a1  (cost=0.29..5.95 rows=1 width=39) (actual time=0.001..0.001 rows=1 loops=9)
                                 Index Cond: ((id = v.id) AND (lang = COALESCE(v.olang)))
                     ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
                           One-Time Filter: false
               ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
                     One-Time Filter: false
         ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
               One-Time Filter: false
   ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=9)
         One-Time Filter: false
 Planning Time: 1.558 ms
 Execution Time: 3.329 ms

I mean, it’s neither the smallest nor the best² plan, but:

The big ugly query inside the vnt() function was properly “inlined” into the SELECT statement where it was used, which allowed the query planner to move the conditions from the WHERE clause to the point where rows from the vn table are being read, which in turn allows it to use an index. Without this optimization, that simple query would have had to read all titles from all VN entries from disk, only to then throw almost everything away again.
There’s a lot of joins, but the planner had correctly inferred that they’ll never match anything and has optimized those out with a “One-Time Filter: false”.

Honestly, that’s pretty awesome. Let’s try sorting on the title.

SELECT id, title, alttitle
  FROM vnt('(en,,,,,,,,,,t,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f,f)'::langprefs) x
 WHERE NOT hidden
 ORDER BY title LIMIT 100

                                                                                           QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=13010.96..13011.21 rows=100 width=68) (actual time=108.074..108.085 rows=100 loops=1)
   ->  Sort  (cost=13010.96..13108.95 rows=39196 width=68) (actual time=108.073..108.079 rows=100 loops=1)
         Sort Key: (COALESCE(t1.latin, t1.title, t2.title, title, title, title))
         Sort Method: top-N heapsort  Memory: 44kB
         ->  Nested Loop Left Join  (cost=4516.79..11512.92 rows=39196 width=68) (actual time=19.879..88.257 rows=39128 loops=1)
               Join Filter: false
               ->  Nested Loop Left Join  (cost=4516.79..11120.96 rows=39196 width=315) (actual time=19.878..82.548 rows=39128 loops=1)
                     Join Filter: false
                     ->  Nested Loop Left Join  (cost=4516.79..10729.00 rows=39196 width=284) (actual time=19.878..77.113 rows=39128 loops=1)
                           Join Filter: false
                           ->  Nested Loop Left Join  (cost=4516.79..10337.04 rows=39196 width=253) (actual time=19.877..71.686 rows=39128 loops=1)
                                 Join Filter: false
                                 ->  Hash Left Join  (cost=4516.79..9945.08 rows=39196 width=222) (actual time=19.877..66.404 rows=39128 loops=1)
                                       Hash Cond: ((v.id = a1.id) AND (COALESCE(v.olang) = a1.lang))
                                       ->  Nested Loop Left Join  (cost=2870.99..8093.50 rows=39196 width=195) (actual time=11.921..50.437 rows=39128 loops=1)
                                             Join Filter: false
                                             ->  Nested Loop Left Join  (cost=2870.99..7701.54 rows=39196 width=164) (actual time=11.920..45.003 rows=39128 loops=1)
                                                   Join Filter: false
                                                   ->  Nested Loop Left Join  (cost=2870.99..7309.58 rows=39196 width=133) (actual time=11.920..39.607 rows=39128 loops=1)
                                                         Join Filter: false
                                                         ->  Hash Left Join  (cost=2870.99..6917.62 rows=39196 width=102) (actual time=11.919..33.959 rows=39128 loops=1)
                                                               Hash Cond: ((v.id = t2.id) AND (COALESCE(v.olang) = t2.lang))
                                                               ->  Hash Left Join  (cost=1225.19..5066.04 rows=39196 width=71) (actual time=3.964..18.276 rows=39128 loops=1)
                                                                     Hash Cond: (v.id = t1.id)
                                                                     ->  Seq Scan on vn v  (cost=0.00..3737.95 rows=39196 width=8) (actual time=0.003..9.290 rows=39128 loops=1)
                                                                           Filter: (NOT hidden)
                                                                           Rows Removed by Filter: 1667
                                                                     ->  Hash  (cost=1081.40..1081.40 rows=11503 width=67) (actual time=3.958..3.958 rows=11535 loops=1)
                                                                           Buckets: 16384  Batches: 1  Memory Usage: 783kB
                                                                           ->  Seq Scan on vn_titles t1  (cost=0.00..1081.40 rows=11503 width=67) (actual time=0.002..2.924 rows=11535 loops=1)
                                                                                 Filter: (lang = 'en'::language)
                                                                                 Rows Removed by Filter: 33617
                                                               ->  Hash  (cost=968.52..968.52 rows=45152 width=39) (actual time=7.933..7.934 rows=45152 loops=1)
                                                                     Buckets: 65536  Batches: 1  Memory Usage: 3749kB
                                                                     ->  Seq Scan on vn_titles t2  (cost=0.00..968.52 rows=45152 width=39) (actual time=0.001..3.368 rows=45152 loops=1)
                                                         ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                                                               One-Time Filter: false
                                                   ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                                                         One-Time Filter: false
                                             ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                                                   One-Time Filter: false
                                       ->  Hash  (cost=968.52..968.52 rows=45152 width=39) (actual time=7.933..7.933 rows=45152 loops=1)
                                             Buckets: 65536  Batches: 1  Memory Usage: 3749kB
                                             ->  Seq Scan on vn_titles a1  (cost=0.00..968.52 rows=45152 width=39) (actual time=0.001..3.385 rows=45152 loops=1)
                                 ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                                       One-Time Filter: false
                           ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                                 One-Time Filter: false
                     ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                           One-Time Filter: false
               ->  Result  (cost=0.00..0.00 rows=0 width=31) (actual time=0.000..0.000 rows=0 loops=39128)
                     One-Time Filter: false
 Planning Time: 0.768 ms
 Execution Time: 108.133 ms

Hmm, wait, all those joins that I had thought the planner had nicely optimized away apparently still take up 5ms+ of processing time each. That’s… not ideal. Let’s look at the vnt view solution for comparison:

                                                                   QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=8415.66..8415.91 rows=100 width=68) (actual time=37.743..37.752 rows=100 loops=1)
   ->  Sort  (cost=8415.66..8513.65 rows=39196 width=68) (actual time=37.742..37.746 rows=100 loops=1)
         Sort Key: (COALESCE(ve.title, vo.latin, vo.title))
         Sort Method: top-N heapsort  Memory: 42kB
         ->  Hash Join  (cost=2870.99..6917.62 rows=39196 width=68) (actual time=12.355..31.546 rows=39128 loops=1)
               Hash Cond: ((v.id = vo.id) AND (v.olang = vo.lang))
               ->  Hash Left Join  (cost=1225.19..5066.04 rows=39196 width=39) (actual time=3.694..16.147 rows=39128 loops=1)
                     Hash Cond: (v.id = ve.id)
                     ->  Seq Scan on vn v  (cost=0.00..3737.95 rows=39196 width=8) (actual time=0.002..8.232 rows=39128 loops=1)
                           Filter: (NOT hidden)
                           Rows Removed by Filter: 1667
                     ->  Hash  (cost=1081.40..1081.40 rows=11503 width=35) (actual time=3.685..3.686 rows=11535 loops=1)
                           Buckets: 16384  Batches: 1  Memory Usage: 782kB
                           ->  Seq Scan on vn_titles ve  (cost=0.00..1081.40 rows=11503 width=35) (actual time=0.001..2.598 rows=11535 loops=1)
                                 Filter: (lang = 'en'::language)
                                 Rows Removed by Filter: 33617
               ->  Hash  (cost=968.52..968.52 rows=45152 width=71) (actual time=8.652..8.652 rows=45152 loops=1)
                     Buckets: 65536  Batches: 1  Memory Usage: 4665kB
                     ->  Seq Scan on vn_titles vo  (cost=0.00..968.52 rows=45152 width=71) (actual time=0.003..3.676 rows=45152 loops=1)
 Planning Time: 0.217 ms
 Execution Time: 37.778 ms

Ouch, much faster. One reason it’s faster is because there’s only a single join to fetch the title for the original language, rather than two in the vnt() function. That can be worked around by special-casing a join on the original language, but the other reason it’s faster is that all those silly unused joins don’t exist at all, and I hadn’t a clue how to fix that.

(Full script ⓘ of this experiment for those who want to follow along, it should work on the public database dumps with minimal modifications)

A Little Detour: Custom Aggregate

So my mind wandered off to try another approach. What if we simply fetch all the relevant titles for the VN entry and throw them through a custom aggregate function that selects the appropriate title? So I gave that a try:

CREATE TYPE vntitle_state AS (
    rank smallint,
    title text
);

CREATE OR REPLACE FUNCTION vntitle_sfunc(state vntitle_state, t vn_titles, olang language, p langprefs) RETURNS vntitle_state AS $$
BEGIN
    IF state.rank > 1 AND COALESCE(p.t1_lang, olang) = t.lang AND (NOT p.t1_official OR t.official) THEN
        RETURN ROW(1::smallint, COALESCE(CASE WHEN p.t1_latin THEN t.latin ELSE NULL END, t.title));
    ELSIF state.rank > 2 AND COALESCE(p.t2_lang, olang) = t.lang AND (NOT p.t2_official OR t.official) THEN
        RETURN ROW(2::smallint, COALESCE(CASE WHEN p.t2_latin THEN t.latin ELSE NULL END, t.title));
    ELSIF state.rank > 3 AND COALESCE(p.t3_lang, olang) = t.lang AND (NOT p.t3_official OR t.official) THEN
        RETURN ROW(3::smallint, COALESCE(CASE WHEN p.t3_latin THEN t.latin ELSE NULL END, t.title));
    ELSIF state.rank > 4 AND COALESCE(p.t4_lang, olang) = t.lang AND (NOT p.t4_official OR t.official) THEN
        RETURN ROW(4::smallint, COALESCE(CASE WHEN p.t4_latin THEN t.latin ELSE NULL END, t.title));
    ELSIF state.rank > 5 AND COALESCE(p.t5_lang, olang) = t.lang AND (NOT p.t5_official OR t.official) THEN
        RETURN ROW(5::smallint, COALESCE(CASE WHEN p.t5_latin THEN t.latin ELSE NULL END, t.title));
    END IF;
    RETURN state;
END;
$$ LANGUAGE plpgsql IMMUTABLE;

CREATE OR REPLACE AGGREGATE vntitle(vn_titles, language, langprefs) (
    SFUNC = vntitle_sfunc,
    STYPE = vntitle_state,
    INITCOND = '(6,)'
);

Requires rewriting our SELECT query a bit, but it’s still usable:

SELECT v.id, vntitle(t, v.olang, '(en,,,,,f,f,f,f,f,f,f,f,f,f)'::langprefs) title
  FROM vn v
  JOIN vn_titles t ON t.id = v.id
 WHERE NOT hidden AND v.id < 'v10'
 GROUP BY v.id;

Uh, that simple example already takes 10ms, making it slower than the vnt() approach. It’s even worse if we want to sort on the title: 114ms. I replaced my custom aggregate with the built-in max() to see if it was just my custom function being slow, but even then we’re at 4.2ms and 52ms, both slower than the original vnt view.

Nope, bad idea (but I kept the full script ⓘ anyway).

Another Little Detour: SupportRequestSimplify

One of the neat things about PostgreSQL is that it exposes much of its internal workings through a C extension API. If you’re running into the limitations of SQL, there’s always the option of dropping into C. The C API exposes a planner support function, which, among other things, has a “SupportRequestSimplify” operation that lets you rewrite a function call into an arbitrary SQL expression tree at planning time.

So what I could do, I thought, is write a C support function that rewrites calls to vnt() with the appropriate optimized SQL statements; thus not emitting any unnecessary JOINs in the first place.

I did write a little function to verify that the SupportRequestSimplify operation is being called for functions in table context - it is - and I looked into having it generate and return a proper table expression, but browsing through the PostgreSQL source code I could not find a single instance of the simplify operation being used in such a way. It only seems to be used (and intended) for simplifying certain types of scalar expressions, so I’m not sure to what extend it’s possible to have it construct complex SQL statements.

Before I was able to dig deeper and really give it a try, I came to my senses and realized that I don’t want to write and maintain complex C code generating custom query trees for each type of database object. I haven’t gone quite that mad… yet.

Back to A Little Experiment

While I was busying myself with the above detours, I kept thinking about ways to optimize the earlier vnt() function. What if, instead of trying to convince the planner that those unused JOINs didn’t return any rows, I could convince it that I didn’t care about the join by not referencing any of its columns, instead? Turns out that works, if the conditions are right. One of those conditions is that the ON filter of the JOIN must exactly match a unique index (or primary key) of the joined table, which is (id,lang) in our case. Adding any more filters causes the optimization to fail, so part of the title selection logic had to move to the SELECT part of the query. Apparently the join filter may not be NULL, either.

With that trick applied, I also implemented proper support for selecting titles that must match the original language and implemented the original language as a special-case so that the same JOIN can be used for both the main and alternative title. The end result isn’t any more elegant:

CREATE OR REPLACE FUNCTION vnt(p langprefs) RETURNS SETOF vnt_type AS $$
  SELECT v.*
       -- The language selection logic below is specially written so that the planner can remove references to joined tables corresponding to NULL languages.
     , COALESCE(
         CASE WHEN p.t1_lang IS NULL OR (p.t1_official AND NOT t1.official) OR (p.t1_official IS NULL AND t1.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.t1_latin THEN t1.latin ELSE NULL END, t1.title) END,
         CASE WHEN p.t2_lang IS NULL OR (p.t2_official AND NOT t2.official) OR (p.t2_official IS NULL AND t2.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.t2_latin THEN t2.latin ELSE NULL END, t2.title) END,
         CASE WHEN p.t3_lang IS NULL OR (p.t3_official AND NOT t3.official) OR (p.t3_official IS NULL AND t3.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.t3_latin THEN t3.latin ELSE NULL END, t3.title) END,
         CASE WHEN p.t4_lang IS NULL OR (p.t4_official AND NOT t4.official) OR (p.t4_official IS NULL AND t4.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.t4_latin THEN t4.latin ELSE NULL END, t4.title) END,
         CASE WHEN p IS NULL OR p.to_latin THEN ol.latin ELSE NULL END, ol.title
       ) AS title
     , COALESCE(
         CASE WHEN p.a1_lang IS NULL OR (p.a1_official AND NOT a1.official) OR (p.a1_official IS NULL AND a1.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.a1_latin THEN a1.latin ELSE NULL END, a1.title) END,
         CASE WHEN p.a2_lang IS NULL OR (p.a2_official AND NOT a2.official) OR (p.a2_official IS NULL AND a2.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.a2_latin THEN a2.latin ELSE NULL END, a2.title) END,
         CASE WHEN p.a3_lang IS NULL OR (p.a3_official AND NOT a3.official) OR (p.a3_official IS NULL AND a3.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.a3_latin THEN a3.latin ELSE NULL END, a3.title) END,
         CASE WHEN p.a4_lang IS NULL OR (p.a4_official AND NOT a4.official) OR (p.a4_official IS NULL AND a4.lang <> v.olang) THEN NULL ELSE COALESCE(CASE WHEN p.a4_latin THEN a4.latin ELSE NULL END, a4.title) END,
         CASE WHEN p.ao_latin THEN ol.latin ELSE NULL END, ol.title
       ) AS alttitle
    FROM vn v
    JOIN vn_titles ol ON ol.id = v.id AND ol.lang = v.olang
    -- The COALESCE() below is kind of meaningless, but apparently the query planner can't optimize out JOINs with NULL conditions.
    LEFT JOIN vn_titles t1 ON t1.id = v.id AND t1.lang = COALESCE(p.t1_lang, 'en')
    LEFT JOIN vn_titles t2 ON t2.id = v.id AND t2.lang = COALESCE(p.t2_lang, 'en')
    LEFT JOIN vn_titles t3 ON t3.id = v.id AND t3.lang = COALESCE(p.t3_lang, 'en')
    LEFT JOIN vn_titles t4 ON t4.id = v.id AND t4.lang = COALESCE(p.t4_lang, 'en')
    LEFT JOIN vn_titles a1 ON a1.id = v.id AND a1.lang = COALESCE(p.a1_lang, 'en')
    LEFT JOIN vn_titles a2 ON a2.id = v.id AND a2.lang = COALESCE(p.a2_lang, 'en')
    LEFT JOIN vn_titles a3 ON a3.id = v.id AND a3.lang = COALESCE(p.a3_lang, 'en')
    LEFT JOIN vn_titles a4 ON a4.id = v.id AND a4.lang = COALESCE(p.a4_lang, 'en')
$$ LANGUAGE SQL STABLE;

Let’s try the sort-by-title benchmark again:

                                                                               QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=8513.65..8513.90 rows=100 width=68) (actual time=39.216..39.226 rows=100 loops=1)
   ->  Sort  (cost=8513.65..8611.64 rows=39196 width=68) (actual time=39.216..39.220 rows=100 loops=1)
         Sort Key: (COALESCE(CASE WHEN (((NOT t1.official) AND NULL::boolean) OR (t1.lang <> v.olang)) THEN NULL::text ELSE COALESCE(t1.title) END, ol.latin, ol.title))
         Sort Method: top-N heapsort  Memory: 43kB
         ->  Hash Join  (cost=2870.99..7015.61 rows=39196 width=68) (actual time=12.583..32.991 rows=39128 loops=1)
               Hash Cond: ((v.id = ol.id) AND (v.olang = ol.lang))
               ->  Hash Left Join  (cost=1225.19..5066.04 rows=39196 width=44) (actual time=3.841..16.826 rows=39128 loops=1)
                     Hash Cond: (v.id = t1.id)
                     ->  Seq Scan on vn v  (cost=0.00..3737.95 rows=39196 width=8) (actual time=0.002..8.535 rows=39128 loops=1)
                           Filter: (NOT hidden)
                           Rows Removed by Filter: 1667
                     ->  Hash  (cost=1081.40..1081.40 rows=11503 width=40) (actual time=3.833..3.834 rows=11535 loops=1)
                           Buckets: 16384  Batches: 1  Memory Usage: 873kB
                           ->  Seq Scan on vn_titles t1  (cost=0.00..1081.40 rows=11503 width=40) (actual time=0.001..2.622 rows=11535 loops=1)
                                 Filter: (lang = 'en'::language)
                                 Rows Removed by Filter: 33617
               ->  Hash  (cost=968.52..968.52 rows=45152 width=71) (actual time=8.731..8.732 rows=45152 loops=1)
                     Buckets: 65536  Batches: 1  Memory Usage: 4665kB
                     ->  Seq Scan on vn_titles ol  (cost=0.00..968.52 rows=45152 width=71) (actual time=0.003..3.744 rows=45152 loops=1)
 Planning Time: 0.545 ms
 Execution Time: 39.252 ms

Hey, that’s the exact same query plan that we got with the original vnt view!

SUCCESS! \o/

A Failed Little Experiment?

…or so I thought. But then I went ahead an implemented that function to a few parts of the site and quickly ran into queries that ran quite a bit slower than they used to. Turns out the query planner gives up on finding the most optimal plan pretty soon once the queries start getting slightly more complex. Taking, for example, the query to fetch the 4 random screenshots on the homepage:

SELECT i.id, i.width, i.height, i.id AS vid, v.title
  FROM (SELECT id, width, height FROM images i TABLESAMPLE SYSTEM (0.93)
         WHERE i.c_weight > 0
           AND vndbid_type(i.id) = 'sf'
           AND i.c_sexual_avg < 40
           AND i.c_violence_avg < 40
         ORDER BY random()
         LIMIT 4) i(id)
  JOIN vn_screenshots vs ON vs.scr = i.id
  JOIN vnt('(en,,,,,,,,f,f,f,f,t,f,f,f,f,f,,,,,f,f,f,f)'::langprefs) v ON v.id = vs.id
 WHERE NOT v.hidden
 ORDER BY random() LIMIT 4;

A fairly staightforward query. Using the vnt view it runs in about 5ms, with the wonderfully optimized vnt() function above, 37ms. The planner completely misses an important optimization. While I’m sure I could find other workarounds for that particular query, I found several similar cases in just a short time. Apart from the work involved in revisiting all queries used by the site, the workarounds necessary to re-optimize them again are going to be fairly ugly.

Increasing join_collapse_limit and from_collapse_limit appears to work, but I have queries that may have 3 or 4 vnt() calls and optimizing those requires bumping the limit by a factor 4 or 5. I’m worried about the impact of doing that site-wide, will have to do some more experiments for that.

This little experiment may not have been the biggest success, but there’s still promise.

(Either way I got a full script ⓘ for this one, too)

The Cliffhanger

And that’s when I decided that I wasn’t going to make much progress and I might as well spend some time writing an article about it. There’s still a bunch of other ideas worth exploring:

I could keep vnt() around for the few special cases where I need title selection to be done entirely in SQL, while falling back to application-generated subqueries for the more common case that I don’t. Downside: I need to maintain multiple different implementations of the same thing.
The simpler option: get rid of the vn_titles table and instead denormalize the list of titles to a single column in the vn table. That should greatly simplify implementing title selection and has the potential to be faster than the current vnt view approach. Downside: I don’t like it.

2023-01-31 update: I ran some experiments and it turns out that this approach is not actually faster than the vnt view.
There’s always the madness option of trying SupportRequestSimplify.

If you have any better ideas, I’d love to hear them as well.

The Conclusion

I wonder how other sites deal with features like this, other than “we don’t” and “we do but you can only select a single language option”. Perhaps I just want too much.

And it’s always fun when people request simple-sounding features (“Hey can you implement language preferences? Super simple! kthnx”) and you end up in a wonderfully deep rabbit hole. I haven’t even fully explored this particular hole yet.

For now, though, I’ll be off exploring other ideas again. I’d like to end with “To be continued…”, but I’m lazy when it comes to writing so there may not be a followup.

Sadly, while the “CREATE TABLE” syntax supports an “ON COMMIT DROP” clause to get rid of the table after the transaction has completed, this feature is not available for views, so the view needs to be dropped or replaced manually to switch back to the default configuration.↩︎
The planner made a bad estimate about the number of rows returned by the “id < 'v10'” filter and ended up choosing a sequential scan on vn_titles when a few index lookups would probably have been faster, but that’s not important for now.↩︎