0001 ================================================================================================
0002 Benchmark for performance of JSON parsing
0003 ================================================================================================
0004
0005 Preparing data for benchmarking ...
0006 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0007 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0008 JSON schema inferring: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0009 ------------------------------------------------------------------------------------------------------------------------
0010 No encoding 68879 68993 116 1.5 688.8 1.0X
0011 UTF-8 is set 115270 115602 455 0.9 1152.7 0.6X
0012
0013 Preparing data for benchmarking ...
0014 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0015 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0016 count a short column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0017 ------------------------------------------------------------------------------------------------------------------------
0018 No encoding 47452 47538 113 2.1 474.5 1.0X
0019 UTF-8 is set 77330 77354 30 1.3 773.3 0.6X
0020
0021 Preparing data for benchmarking ...
0022 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0023 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0024 count a wide column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0025 ------------------------------------------------------------------------------------------------------------------------
0026 No encoding 60470 60900 534 0.2 6047.0 1.0X
0027 UTF-8 is set 104733 104931 189 0.1 10473.3 0.6X
0028
0029 Preparing data for benchmarking ...
0030 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0031 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0032 select wide row: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0033 ------------------------------------------------------------------------------------------------------------------------
0034 No encoding 130302 131072 976 0.0 260604.6 1.0X
0035 UTF-8 is set 150860 151284 377 0.0 301720.1 0.9X
0036
0037 Preparing data for benchmarking ...
0038 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0039 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0040 Select a subset of 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0041 ------------------------------------------------------------------------------------------------------------------------
0042 Select 10 columns 18619 18684 99 0.5 1861.9 1.0X
0043 Select 1 column 24227 24270 38 0.4 2422.7 0.8X
0044
0045 Preparing data for benchmarking ...
0046 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0047 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0048 creation of JSON parser per line: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0049 ------------------------------------------------------------------------------------------------------------------------
0050 Short column without encoding 7947 7971 21 1.3 794.7 1.0X
0051 Short column with UTF-8 12700 12753 58 0.8 1270.0 0.6X
0052 Wide column without encoding 92632 92955 463 0.1 9263.2 0.1X
0053 Wide column with UTF-8 147013 147170 188 0.1 14701.3 0.1X
0054
0055 Preparing data for benchmarking ...
0056 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0057 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0058 JSON functions: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0059 ------------------------------------------------------------------------------------------------------------------------
0060 Text read 713 734 19 14.0 71.3 1.0X
0061 from_json 22019 22429 456 0.5 2201.9 0.0X
0062 json_tuple 27987 28047 74 0.4 2798.7 0.0X
0063 get_json_object 21468 21870 350 0.5 2146.8 0.0X
0064
0065 Preparing data for benchmarking ...
0066 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0067 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0068 Dataset of json strings: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0069 ------------------------------------------------------------------------------------------------------------------------
0070 Text read 2887 2910 24 17.3 57.7 1.0X
0071 schema inferring 31793 31843 43 1.6 635.9 0.1X
0072 parsing 36791 37104 294 1.4 735.8 0.1X
0073
0074 Preparing data for benchmarking ...
0075 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0076 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0077 Json files in the per-line mode: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0078 ------------------------------------------------------------------------------------------------------------------------
0079 Text read 10570 10611 45 4.7 211.4 1.0X
0080 Schema inferring 48729 48763 41 1.0 974.6 0.2X
0081 Parsing without charset 35490 35648 141 1.4 709.8 0.3X
0082 Parsing with UTF-8 63853 63994 163 0.8 1277.1 0.2X
0083
0084 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0085 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0086 Write dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0087 ------------------------------------------------------------------------------------------------------------------------
0088 Create a dataset of timestamps 2187 2190 5 4.6 218.7 1.0X
0089 to_json(timestamp) 16262 16503 323 0.6 1626.2 0.1X
0090 write timestamps to files 11679 11692 12 0.9 1167.9 0.2X
0091 Create a dataset of dates 2297 2310 12 4.4 229.7 1.0X
0092 to_json(date) 10904 10956 46 0.9 1090.4 0.2X
0093 write dates to files 6610 6645 35 1.5 661.0 0.3X
0094
0095 OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-1063-aws
0096 Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
0097 Read dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
0098 ------------------------------------------------------------------------------------------------------------------------
0099 read timestamp text from files 2524 2530 9 4.0 252.4 1.0X
0100 read timestamps from files 41002 41052 59 0.2 4100.2 0.1X
0101 infer timestamps from files 84621 84939 526 0.1 8462.1 0.0X
0102 read date text from files 2292 2302 9 4.4 229.2 1.1X
0103 read date from files 16954 16976 21 0.6 1695.4 0.1X
0104 timestamp strings 3067 3077 13 3.3 306.7 0.8X
0105 parse timestamps from Dataset[String] 48690 48971 243 0.2 4869.0 0.1X
0106 infer timestamps from Dataset[String] 97463 97786 338 0.1 9746.3 0.0X
0107 date strings 3952 3956 3 2.5 395.2 0.6X
0108 parse dates from Dataset[String] 24210 24241 30 0.4 2421.0 0.1X
0109 from_json(timestamp) 71710 72242 629 0.1 7171.0 0.0X
0110 from_json(date) 42465 42481 13 0.2 4246.5 0.1X
0111
0112