Manticore supports SELECT subqueries via SQL in the following format:
SELECT * FROM (SELECT ... ORDER BY cond1 LIMIT X) ORDER BY cond2 LIMIT Y
The outer select allows only ORDER BY
and LIMIT
clauses. Sub-select queries currently have two use cases:
When you have a query with two ranking UDFs, one very fast and the other slow, and perform a full-text search with a large match result set. Without subselect, the query would look like:
SELECT id,slow_rank() as slow,fast_rank() as fast FROM index
WHERE MATCH(‘some common query terms’) ORDER BY fast DESC, slow DESC LIMIT 20
OPTION max_matches=1000;
With sub-selects, the query can be rewritten as:
SELECT * FROM
SELECT id,slow_rank() as slow,fast_rank() as fast FROM index WHERE
(query terms’)
MATCH(‘some common ORDER BY fast DESC LIMIT 100 OPTION max_matches=1000)
ORDER BY slow DESC LIMIT 20;
In the initial query, the slow_rank()
UDF is computed for the entire match result set. With SELECT sub-queries, only fast_rank()
is computed for the entire match result set, while slow_rank()
is computed for a limited set.
The second case is useful for large result sets coming from a distributed table.
For this query:
SELECT * FROM my_dist_index WHERE some_conditions LIMIT 50000;
If you have 20 nodes, each node can send back to the master a maximum of 50K records, resulting in 20 x 50K = 1M records. However, since the master sends back only 50K (out of 1M), it might be good enough for the nodes to send only the top 10K records. With sub-select, you can rewrite the query as:
SELECT * FROM
SELECT * FROM my_dist_index WHERE some_conditions LIMIT 10000)
(ORDER by some_attr LIMIT 50000;
In this case, the nodes receive only the inner query and execute it. This means the master will receive only 20x10K=200K records. The master will take all the records received, reorder them by the OUTER clause, and return the best 50K records. The sub-select helps reduce the traffic between the master and the nodes, as well as reduce the master’s computation time (since it processes only 200K instead of 1M records).