0001 ================================================================================================
0002 Benchmark for performance of JSON parsing
0003 ================================================================================================
0004
0005 Preparing data for benchmarking ...
0006 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0007 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0008 JSON schema inferring: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0009 ------------------------------------------------------------------------------------------------------------------------
0010 No encoding 63981 64044 56 1.6 639.8 1.0X
0011 UTF-8 is set 112672 113350 962 0.9 1126.7 0.6X
0012
0013 Preparing data for benchmarking ...
0014 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0015 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0016 count a short column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0017 ------------------------------------------------------------------------------------------------------------------------
0018 No encoding 51256 51449 180 2.0 512.6 1.0X
0019 UTF-8 is set 83694 83859 148 1.2 836.9 0.6X
0020
0021 Preparing data for benchmarking ...
0022 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0023 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0024 count a wide column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0025 ------------------------------------------------------------------------------------------------------------------------
0026 No encoding 58440 59097 569 0.2 5844.0 1.0X
0027 UTF-8 is set 102746 102883 198 0.1 10274.6 0.6X
0028
0029 Preparing data for benchmarking ...
0030 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0031 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0032 select wide row: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0033 ------------------------------------------------------------------------------------------------------------------------
0034 No encoding 128982 129304 356 0.0 257965.0 1.0X
0035 UTF-8 is set 147247 147415 231 0.0 294494.1 0.9X
0036
0037 Preparing data for benchmarking ...
0038 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0039 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0040 Select a subset of 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0041 ------------------------------------------------------------------------------------------------------------------------
0042 Select 10 columns 18837 19048 331 0.5 1883.7 1.0X
0043 Select 1 column 24707 24723 14 0.4 2470.7 0.8X
0044
0045 Preparing data for benchmarking ...
0046 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0047 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0048 creation of JSON parser per line: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0049 ------------------------------------------------------------------------------------------------------------------------
0050 Short column without encoding 8218 8234 17 1.2 821.8 1.0X
0051 Short column with UTF-8 12374 12438 107 0.8 1237.4 0.7X
0052 Wide column without encoding 136918 137298 345 0.1 13691.8 0.1X
0053 Wide column with UTF-8 176961 177142 257 0.1 17696.1 0.0X
0054
0055 Preparing data for benchmarking ...
0056 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0057 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0058 JSON functions: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0059 ------------------------------------------------------------------------------------------------------------------------
0060 Text read 1268 1278 12 7.9 126.8 1.0X
0061 from_json 23348 23479 176 0.4 2334.8 0.1X
0062 json_tuple 29606 30221 1024 0.3 2960.6 0.0X
0063 get_json_object 21898 22148 226 0.5 2189.8 0.1X
0064
0065 Preparing data for benchmarking ...
0066 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0067 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0068 Dataset of json strings: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0069 ------------------------------------------------------------------------------------------------------------------------
0070 Text read 5887 5944 49 8.5 117.7 1.0X
0071 schema inferring 46696 47054 312 1.1 933.9 0.1X
0072 parsing 32336 32450 129 1.5 646.7 0.2X
0073
0074 Preparing data for benchmarking ...
0075 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0076 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0077 Json files in the per-line mode: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0078 ------------------------------------------------------------------------------------------------------------------------
0079 Text read 9756 9769 11 5.1 195.1 1.0X
0080 Schema inferring 51318 51433 108 1.0 1026.4 0.2X
0081 Parsing without charset 43609 43743 118 1.1 872.2 0.2X
0082 Parsing with UTF-8 60775 60844 106 0.8 1215.5 0.2X
0083
0084 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0085 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0086 Write dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0087 ------------------------------------------------------------------------------------------------------------------------
0088 Create a dataset of timestamps 1998 2015 17 5.0 199.8 1.0X
0089 to_json(timestamp) 18156 18317 263 0.6 1815.6 0.1X
0090 write timestamps to files 12912 12917 5 0.8 1291.2 0.2X
0091 Create a dataset of dates 2209 2270 53 4.5 220.9 0.9X
0092 to_json(date) 9433 9489 90 1.1 943.3 0.2X
0093 write dates to files 6915 6923 8 1.4 691.5 0.3X
0094
0095 OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
0096 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0097 Read dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0098 ------------------------------------------------------------------------------------------------------------------------
0099 read timestamp text from files 2395 2412 17 4.2 239.5 1.0X
0100 read timestamps from files 47269 47334 89 0.2 4726.9 0.1X
0101 infer timestamps from files 91806 91851 67 0.1 9180.6 0.0X
0102 read date text from files 2118 2133 13 4.7 211.8 1.1X
0103 read date from files 17267 17340 115 0.6 1726.7 0.1X
0104 timestamp strings 3906 3935 26 2.6 390.6 0.6X
0105 parse timestamps from Dataset[String] 52244 52534 279 0.2 5224.4 0.0X
0106 infer timestamps from Dataset[String] 100488 100714 198 0.1 10048.8 0.0X
0107 date strings 4572 4584 12 2.2 457.2 0.5X
0108 parse dates from Dataset[String] 26749 26768 17 0.4 2674.9 0.1X
0109 from_json(timestamp) 71414 71867 556 0.1 7141.4 0.0X
0110 from_json(date) 45322 45549 250 0.2 4532.2 0.1X
0111
0112