Apply Attributes
This page provides details on how to apply attributes in PyKX.
Tip
For the best experience, start by reading about PyKX attributes.
In PyKX, you can apply attributes to various data structures, including Vector
/List
types, Tables
, and KeyedTable
s. To apply the attributes, call the sorted
, unique
, grouped
, and parted
methods on these objects.
Sorted
The sorted
attribute ensures that all items in the Vector
/ Table
column are sorted in ascending order. This attribute will be removed if you append to the list with an item that is not in sorted order.
Example of applying the sorted attribute to a Vector by calling the sorted method on the Vector:
Python
>>> a = kx.q.til(10)
>>> a
pykx.LongVector(pykx.q('0 1 2 3 4 5 6 7 8 9'))
>>> a.sorted()
pykx.LongVector(pykx.q('`s#0 1 2 3 4 5 6 7 8 9'))
Unique
The unique
attribute ensures that all items in the Vector
/ Table
column are unique (there are no duplicated values). This attribute will be removed if you append to the list with an item that is not unique.
Example of applying the unique attribute to the first column of the table:
Python
>>> a = kx.Table(data = {
... 'a': kx.q.til(5),
... 'b': ['a', 'b', 'c', 'd', 'e']
... })
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j
b| s
'))
>>> a = a.unique()
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j u
b| s
'))
Grouped
The grouped
attribute ensures that all items in the Vector
/ Table
column are stored in adifferent format to help reduce memory usage. It creates a backing dictionary to store the value and indexes that each value has within the list.
Unlike other attributes, the grouped attribute will be kept on all insert operations to the list. For instance, this is how a grouped list would be stored:
Python
// The list
`g#`a`b`c`a`b`b`c
// The backing dictionary
a| 0 3
b| 1 4 5
c| 2 6
Example of applying the grouped attribute to a specified column of a table:
Python
>>> a = kx.Table(data = {
... 'a': kx.q.til(5),
... 'b': ['a', 'a', 'b', 'b', 'b']
... })
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j
b| s
'))
>>> a = a.grouped('b')
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j
b| s g
'))
Parted
The parted
attribute is similar to the grouped attribute with the additional requirement that each unique value must be adjacent to its other copies, where the grouped attribute allows them to be dispersed throughout the Vector
/ Table
.
When possible, the parted attribute results in a larger performance gain than using the grouped attribute. This attribute will be removed if you append to the list with an item that is not in the parted order.
Python
// Can be parted
`p#`a`a`a`e`e`b`b`c`c`c`d
// Has to be grouped as the `d symbols are not all contiguous within the vector
`g#`a`a`d`e`e`b`b`c`c`c`d
Example of applying the parted attribute to multiple columns on a table:
Python
>>> a = kx.Table(data = {
... 'a': kx.q.til(5),
... 'b': ['a', 'a', 'b', 'b', 'b']
... })
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j
b| s
'))
>>> a = a.parted(['a', 'b'])
>>> kx.q.meta(a)
pykx.KeyedTable(pykx.q('
c| t f a
-| -----
a| j p
b| s p
'))
Performance
When attributes are set on PyKX objects, various functions can use these attributes to speed up their execution, by using different algorithms. For example, searching through a list without an attribute requires checking every single value. However, setting the sorted attribute allows a search algorithm to use a binary search instead and then only a fraction of the values actually needs to be checked.
Examples of functions that can use attributes to speed up execution:
-
Where clauses in
select
andexec
templates run faster withwhere =
,where in
andwhere within
.
Note
Setting attributes consumes resources and is likely to improve performance on large lists.