2024-11-23 16:20:42,240 - Epoch 0, Loss: 0.4148, Throughput: 140.23 samples/s 2024-11-23 16:20:48,127 - Epoch 1, Loss: 0.2173, Throughput: 193.91 samples/s 2024-11-23 16:20:53,857 - Epoch 2, Loss: 0.1491, Throughput: 194.12 samples/s 2024-11-23 16:20:59,494 - Epoch 3, Loss: 0.1180, Throughput: 194.32 samples/s 2024-11-23 16:21:05,001 - Epoch 4, Loss: 0.0992, Throughput: 191.94 samples/s 2024-11-23 16:21:10,690 - Epoch 5, Loss: 0.0895, Throughput: 192.20 samples/s 2024-11-23 16:21:16,278 - Epoch 6, Loss: 0.0817, Throughput: 193.87 samples/s 2024-11-23 16:21:21,789 - Epoch 7, Loss: 0.0763, Throughput: 194.27 samples/s 2024-11-23 16:21:27,306 - Epoch 8, Loss: 0.0750, Throughput: 193.02 samples/s 2024-11-23 16:21:32,909 - Epoch 9, Loss: 0.0688, Throughput: 193.54 samples/s 2024-11-23 16:21:38,495 - Epoch 10, Loss: 0.0635, Throughput: 193.87 samples/s 2024-11-23 16:21:44,002 - Epoch 11, Loss: 0.0591, Throughput: 193.99 samples/s 2024-11-23 16:21:49,427 - Epoch 12, Loss: 0.0574, Throughput: 194.88 samples/s 2024-11-23 16:21:54,855 - Epoch 13, Loss: 0.0517, Throughput: 193.26 samples/s 2024-11-23 16:22:00,345 - Epoch 14, Loss: 0.0577, Throughput: 193.23 samples/s 2024-11-23 16:22:05,789 - Epoch 15, Loss: 0.0526, Throughput: 194.32 samples/s 2024-11-23 16:22:11,419 - Epoch 16, Loss: 0.0463, Throughput: 193.18 samples/s 2024-11-23 16:22:17,302 - Epoch 17, Loss: 0.0508, Throughput: 191.12 samples/s 2024-11-23 16:22:22,768 - Epoch 18, Loss: 0.0472, Throughput: 192.89 samples/s 2024-11-23 16:22:28,368 - Epoch 19, Loss: 0.0482, Throughput: 193.79 samples/s 2024-11-23 16:22:33,918 - Epoch 20, Loss: 0.0462, Throughput: 193.45 samples/s 2024-11-23 16:22:39,693 - Epoch 21, Loss: 0.0419, Throughput: 194.47 samples/s 2024-11-23 16:22:45,408 - Epoch 22, Loss: 0.0484, Throughput: 192.99 samples/s 2024-11-23 16:22:50,911 - Epoch 23, Loss: 0.0383, Throughput: 192.91 samples/s 2024-11-23 16:22:56,453 - Epoch 24, Loss: 0.0384, Throughput: 195.25 samples/s 2024-11-23 16:23:02,010 - Epoch 25, Loss: 0.0369, Throughput: 193.38 samples/s 2024-11-23 16:23:07,687 - Epoch 26, Loss: 0.0388, Throughput: 193.53 samples/s 2024-11-23 16:23:13,300 - Epoch 27, Loss: 0.0357, Throughput: 194.07 samples/s 2024-11-23 16:23:18,782 - Epoch 28, Loss: 0.0353, Throughput: 193.43 samples/s 2024-11-23 16:23:24,326 - Epoch 29, Loss: 0.0343, Throughput: 193.61 samples/s 2024-11-23 16:23:29,895 - Epoch 30, Loss: 0.0376, Throughput: 191.33 samples/s 2024-11-23 16:23:35,501 - Epoch 31, Loss: 0.0346, Throughput: 193.02 samples/s 2024-11-23 16:23:41,641 - Epoch 32, Loss: 0.0341, Throughput: 193.21 samples/s 2024-11-23 16:23:47,865 - Epoch 33, Loss: 0.0337, Throughput: 194.62 samples/s 2024-11-23 16:23:54,096 - Epoch 34, Loss: 0.0330, Throughput: 193.06 samples/s 2024-11-23 16:24:00,361 - Epoch 35, Loss: 0.0358, Throughput: 192.89 samples/s 2024-11-23 16:24:06,582 - Epoch 36, Loss: 0.0354, Throughput: 193.04 samples/s 2024-11-23 16:24:12,861 - Epoch 37, Loss: 0.0330, Throughput: 192.99 samples/s 2024-11-23 16:24:19,262 - Epoch 38, Loss: 0.0307, Throughput: 193.63 samples/s 2024-11-23 16:24:25,720 - Epoch 39, Loss: 0.0312, Throughput: 192.75 samples/s 2024-11-23 16:24:32,184 - Epoch 40, Loss: 0.0302, Throughput: 194.05 samples/s 2024-11-23 16:24:38,731 - Epoch 41, Loss: 0.0304, Throughput: 193.81 samples/s 2024-11-23 16:24:45,066 - Epoch 42, Loss: 0.0328, Throughput: 193.17 samples/s 2024-11-23 16:24:51,402 - Epoch 43, Loss: 0.0308, Throughput: 192.77 samples/s 2024-11-23 16:24:57,852 - Epoch 44, Loss: 0.0311, Throughput: 191.84 samples/s 2024-11-23 16:25:04,448 - Epoch 45, Loss: 0.0296, Throughput: 192.21 samples/s 2024-11-23 16:25:10,944 - Epoch 46, Loss: 0.0300, Throughput: 193.34 samples/s 2024-11-23 16:25:17,298 - Epoch 47, Loss: 0.0296, Throughput: 193.35 samples/s 2024-11-23 16:25:23,876 - Epoch 48, Loss: 0.0284, Throughput: 192.58 samples/s 2024-11-23 16:25:30,385 - Epoch 49, Loss: 0.0290, Throughput: 193.30 samples/s 2024-11-23 16:25:37,259 - Epoch 50, Loss: 0.0301, Throughput: 193.95 samples/s 2024-11-23 16:25:43,600 - Epoch 51, Loss: 0.0290, Throughput: 193.47 samples/s 2024-11-23 16:25:51,015 - Epoch 52, Loss: 0.0292, Throughput: 193.50 samples/s 2024-11-23 16:25:58,285 - Epoch 53, Loss: 0.0284, Throughput: 193.52 samples/s 2024-11-23 16:26:05,062 - Epoch 54, Loss: 0.0281, Throughput: 193.20 samples/s 2024-11-23 16:26:12,106 - Epoch 55, Loss: 0.0280, Throughput: 192.73 samples/s 2024-11-23 16:26:19,212 - Epoch 56, Loss: 0.0273, Throughput: 192.01 samples/s 2024-11-23 16:26:26,142 - Epoch 57, Loss: 0.0276, Throughput: 191.79 samples/s 2024-11-23 16:26:33,139 - Epoch 58, Loss: 0.0271, Throughput: 193.29 samples/s 2024-11-23 16:26:40,044 - Epoch 59, Loss: 0.0283, Throughput: 193.29 samples/s 2024-11-23 16:26:46,947 - Epoch 60, Loss: 0.0275, Throughput: 193.61 samples/s 2024-11-23 16:26:54,026 - Epoch 61, Loss: 0.0278, Throughput: 192.97 samples/s 2024-11-23 16:27:00,777 - Epoch 62, Loss: 0.0272, Throughput: 192.79 samples/s 2024-11-23 16:27:07,926 - Epoch 63, Loss: 0.0270, Throughput: 192.91 samples/s 2024-11-23 16:27:14,736 - Epoch 64, Loss: 0.0276, Throughput: 194.16 samples/s 2024-11-23 16:27:21,765 - Epoch 65, Loss: 0.0268, Throughput: 193.42 samples/s 2024-11-23 16:27:28,801 - Epoch 66, Loss: 0.0260, Throughput: 193.04 samples/s 2024-11-23 16:27:36,154 - Epoch 67, Loss: 0.0265, Throughput: 193.63 samples/s 2024-11-23 16:27:43,513 - Epoch 68, Loss: 0.0259, Throughput: 192.52 samples/s 2024-11-23 16:27:50,636 - Epoch 69, Loss: 0.0267, Throughput: 191.37 samples/s 2024-11-23 16:27:57,243 - Epoch 70, Loss: 0.0258, Throughput: 193.67 samples/s 2024-11-23 16:28:04,184 - Epoch 71, Loss: 0.0254, Throughput: 194.79 samples/s 2024-11-23 16:28:11,243 - Epoch 72, Loss: 0.0258, Throughput: 192.77 samples/s 2024-11-23 16:28:18,459 - Epoch 73, Loss: 0.0265, Throughput: 192.94 samples/s 2024-11-23 16:28:26,129 - Epoch 74, Loss: 0.0253, Throughput: 193.04 samples/s 2024-11-23 16:28:33,475 - Epoch 75, Loss: 0.0262, Throughput: 193.65 samples/s 2024-11-23 16:28:41,543 - Epoch 76, Loss: 0.0259, Throughput: 193.15 samples/s 2024-11-23 16:28:49,962 - Epoch 77, Loss: 0.0257, Throughput: 194.23 samples/s 2024-11-23 16:28:57,061 - Epoch 78, Loss: 0.0258, Throughput: 193.42 samples/s 2024-11-23 16:29:04,572 - Epoch 79, Loss: 0.0256, Throughput: 194.00 samples/s 2024-11-23 16:29:12,263 - Epoch 80, Loss: 0.0246, Throughput: 193.31 samples/s 2024-11-23 16:29:19,662 - Epoch 81, Loss: 0.0252, Throughput: 192.85 samples/s 2024-11-23 16:29:27,171 - Epoch 82, Loss: 0.0248, Throughput: 192.35 samples/s 2024-11-23 16:29:34,917 - Epoch 83, Loss: 0.0245, Throughput: 192.21 samples/s 2024-11-23 16:29:42,395 - Epoch 84, Loss: 0.0250, Throughput: 192.84 samples/s 2024-11-23 16:29:49,449 - Epoch 85, Loss: 0.0264, Throughput: 192.25 samples/s 2024-11-23 16:29:56,828 - Epoch 86, Loss: 0.0251, Throughput: 194.46 samples/s 2024-11-23 16:30:03,465 - Epoch 87, Loss: 0.0245, Throughput: 192.69 samples/s 2024-11-23 16:30:10,755 - Epoch 88, Loss: 0.0247, Throughput: 193.29 samples/s 2024-11-23 16:30:17,968 - Epoch 89, Loss: 0.0253, Throughput: 192.63 samples/s 2024-11-23 16:30:25,268 - Epoch 90, Loss: 0.0248, Throughput: 193.06 samples/s 2024-11-23 16:30:32,534 - Epoch 91, Loss: 0.0244, Throughput: 194.27 samples/s 2024-11-23 16:30:39,675 - Epoch 92, Loss: 0.0246, Throughput: 192.40 samples/s 2024-11-23 16:30:47,048 - Epoch 93, Loss: 0.0257, Throughput: 194.44 samples/s 2024-11-23 16:30:54,344 - Epoch 94, Loss: 0.0244, Throughput: 193.74 samples/s 2024-11-23 16:31:01,287 - Epoch 95, Loss: 0.0244, Throughput: 192.23 samples/s 2024-11-23 16:31:08,447 - Epoch 96, Loss: 0.0240, Throughput: 192.48 samples/s 2024-11-23 16:31:15,484 - Epoch 97, Loss: 0.0239, Throughput: 194.09 samples/s 2024-11-23 16:31:22,841 - Epoch 98, Loss: 0.0240, Throughput: 194.18 samples/s 2024-11-23 16:31:30,000 - Epoch 99, Loss: 0.0265, Throughput: 193.42 samples/s 2024-11-23 16:31:37,888 - Epoch 100, Loss: 0.0239, Throughput: 192.91 samples/s 2024-11-23 16:31:45,246 - Epoch 101, Loss: 0.0247, Throughput: 193.47 samples/s 2024-11-23 16:31:52,591 - Epoch 102, Loss: 0.0237, Throughput: 192.39 samples/s 2024-11-23 16:31:59,484 - Epoch 103, Loss: 0.0239, Throughput: 192.77 samples/s 2024-11-23 16:32:06,628 - Epoch 104, Loss: 0.0235, Throughput: 192.84 samples/s 2024-11-23 16:32:13,978 - Epoch 105, Loss: 0.0235, Throughput: 192.73 samples/s 2024-11-23 16:32:20,681 - Epoch 106, Loss: 0.0230, Throughput: 193.95 samples/s 2024-11-23 16:32:27,464 - Epoch 107, Loss: 0.0234, Throughput: 190.78 samples/s 2024-11-23 16:32:34,683 - Epoch 108, Loss: 0.0231, Throughput: 192.09 samples/s 2024-11-23 16:32:41,401 - Epoch 109, Loss: 0.0229, Throughput: 193.26 samples/s 2024-11-23 16:32:59,351 - Epoch 110, Loss: 0.0231, Throughput: 192.33 samples/s 2024-11-23 16:33:06,120 - Epoch 111, Loss: 0.0231, Throughput: 192.90 samples/s 2024-11-23 16:33:12,662 - Epoch 112, Loss: 0.0229, Throughput: 192.95 samples/s 2024-11-23 16:33:19,582 - Epoch 113, Loss: 0.0232, Throughput: 194.15 samples/s 2024-11-23 16:33:26,018 - Epoch 114, Loss: 0.0236, Throughput: 193.22 samples/s 2024-11-23 16:33:32,424 - Epoch 115, Loss: 0.0229, Throughput: 193.39 samples/s 2024-11-23 16:33:40,252 - Epoch 116, Loss: 0.0229, Throughput: 193.43 samples/s 2024-11-23 16:33:46,897 - Epoch 117, Loss: 0.0224, Throughput: 193.94 samples/s 2024-11-23 16:33:53,551 - Epoch 118, Loss: 0.0226, Throughput: 193.49 samples/s 2024-11-23 16:33:59,952 - Epoch 119, Loss: 0.0229, Throughput: 193.27 samples/s 2024-11-23 16:34:06,933 - Epoch 120, Loss: 0.0225, Throughput: 192.65 samples/s 2024-11-23 16:34:14,255 - Epoch 121, Loss: 0.0226, Throughput: 192.75 samples/s 2024-11-23 16:34:21,120 - Epoch 122, Loss: 0.0227, Throughput: 193.03 samples/s 2024-11-23 16:34:28,440 - Epoch 123, Loss: 0.0230, Throughput: 192.79 samples/s 2024-11-23 16:34:35,481 - Epoch 124, Loss: 0.0224, Throughput: 192.40 samples/s 2024-11-23 16:34:42,209 - Epoch 125, Loss: 0.0225, Throughput: 192.68 samples/s 2024-11-23 16:34:49,745 - Epoch 126, Loss: 0.0229, Throughput: 192.75 samples/s 2024-11-23 16:34:56,527 - Epoch 127, Loss: 0.0225, Throughput: 192.87 samples/s 2024-11-23 16:35:03,279 - Epoch 128, Loss: 0.0223, Throughput: 193.28 samples/s 2024-11-23 16:35:10,324 - Epoch 129, Loss: 0.0226, Throughput: 192.75 samples/s 2024-11-23 16:35:17,581 - Epoch 130, Loss: 0.0223, Throughput: 192.86 samples/s 2024-11-23 16:35:24,501 - Epoch 131, Loss: 0.0222, Throughput: 192.87 samples/s 2024-11-23 16:35:31,594 - Epoch 132, Loss: 0.0223, Throughput: 193.22 samples/s 2024-11-23 16:35:38,886 - Epoch 133, Loss: 0.0219, Throughput: 192.68 samples/s 2024-11-23 16:35:45,701 - Epoch 134, Loss: 0.0231, Throughput: 191.80 samples/s 2024-11-23 16:35:51,916 - Epoch 135, Loss: 0.0225, Throughput: 193.53 samples/s 2024-11-23 16:35:58,100 - Epoch 136, Loss: 0.0221, Throughput: 192.67 samples/s 2024-11-23 16:36:04,607 - Epoch 137, Loss: 0.0223, Throughput: 192.65 samples/s 2024-11-23 16:36:11,448 - Epoch 138, Loss: 0.0223, Throughput: 191.97 samples/s 2024-11-23 16:36:19,435 - Epoch 139, Loss: 0.0220, Throughput: 193.04 samples/s 2024-11-23 16:36:27,423 - Epoch 140, Loss: 0.0218, Throughput: 193.51 samples/s 2024-11-23 16:36:35,549 - Epoch 141, Loss: 0.0218, Throughput: 193.60 samples/s 2024-11-23 16:36:42,921 - Epoch 142, Loss: 0.0221, Throughput: 194.00 samples/s 2024-11-23 16:36:50,384 - Epoch 143, Loss: 0.0218, Throughput: 193.93 samples/s 2024-11-23 16:36:57,303 - Epoch 144, Loss: 0.0218, Throughput: 193.03 samples/s 2024-11-23 16:37:04,385 - Epoch 145, Loss: 0.0214, Throughput: 193.17 samples/s 2024-11-23 16:37:12,042 - Epoch 146, Loss: 0.0214, Throughput: 192.58 samples/s 2024-11-23 16:37:18,775 - Epoch 147, Loss: 0.0213, Throughput: 192.90 samples/s 2024-11-23 16:37:25,643 - Epoch 148, Loss: 0.0218, Throughput: 192.96 samples/s 2024-11-23 16:37:32,610 - Epoch 149, Loss: 0.0216, Throughput: 192.52 samples/s 2024-11-23 16:37:39,500 - Epoch 150, Loss: 0.0220, Throughput: 192.69 samples/s 2024-11-23 16:37:46,565 - Epoch 151, Loss: 0.0214, Throughput: 192.49 samples/s 2024-11-23 16:37:54,785 - Epoch 152, Loss: 0.0214, Throughput: 192.76 samples/s 2024-11-23 16:38:02,084 - Epoch 153, Loss: 0.0215, Throughput: 193.61 samples/s 2024-11-23 16:38:09,095 - Epoch 154, Loss: 0.0212, Throughput: 192.90 samples/s 2024-11-23 16:38:16,360 - Epoch 155, Loss: 0.0212, Throughput: 193.23 samples/s 2024-11-23 16:38:23,627 - Epoch 156, Loss: 0.0211, Throughput: 192.77 samples/s 2024-11-23 16:38:30,112 - Epoch 157, Loss: 0.0216, Throughput: 192.67 samples/s 2024-11-23 16:38:36,641 - Epoch 158, Loss: 0.0208, Throughput: 193.94 samples/s 2024-11-23 16:38:45,221 - Epoch 159, Loss: 0.0211, Throughput: 192.63 samples/s 2024-11-23 16:38:52,832 - Epoch 160, Loss: 0.0212, Throughput: 193.47 samples/s 2024-11-23 16:38:59,920 - Epoch 161, Loss: 0.0206, Throughput: 195.10 samples/s 2024-11-23 16:39:06,686 - Epoch 162, Loss: 0.0208, Throughput: 193.63 samples/s 2024-11-23 16:39:13,560 - Epoch 163, Loss: 0.0212, Throughput: 194.16 samples/s 2024-11-23 16:39:20,444 - Epoch 164, Loss: 0.0207, Throughput: 193.04 samples/s 2024-11-23 16:39:27,078 - Epoch 165, Loss: 0.0212, Throughput: 193.67 samples/s 2024-11-23 16:39:34,051 - Epoch 166, Loss: 0.0208, Throughput: 194.04 samples/s 2024-11-23 16:39:41,180 - Epoch 167, Loss: 0.0208, Throughput: 192.85 samples/s 2024-11-23 16:39:48,679 - Epoch 168, Loss: 0.0229, Throughput: 194.20 samples/s 2024-11-23 16:39:55,408 - Epoch 169, Loss: 0.0208, Throughput: 192.86 samples/s 2024-11-23 16:40:02,179 - Epoch 170, Loss: 0.0246, Throughput: 193.36 samples/s 2024-11-23 16:40:09,263 - Epoch 171, Loss: 0.0215, Throughput: 193.70 samples/s 2024-11-23 16:40:15,720 - Epoch 172, Loss: 0.0211, Throughput: 191.08 samples/s 2024-11-23 16:40:22,104 - Epoch 173, Loss: 0.0210, Throughput: 194.80 samples/s 2024-11-23 16:40:28,427 - Epoch 174, Loss: 0.0211, Throughput: 193.34 samples/s 2024-11-23 16:40:35,529 - Epoch 175, Loss: 0.0204, Throughput: 192.82 samples/s 2024-11-23 16:40:42,043 - Epoch 176, Loss: 0.0285, Throughput: 193.51 samples/s 2024-11-23 16:40:48,358 - Epoch 177, Loss: 0.0211, Throughput: 194.52 samples/s 2024-11-23 16:40:54,800 - Epoch 178, Loss: 0.0214, Throughput: 194.46 samples/s 2024-11-23 16:41:01,075 - Epoch 179, Loss: 0.0211, Throughput: 194.59 samples/s 2024-11-23 16:41:08,125 - Epoch 180, Loss: 0.0216, Throughput: 192.77 samples/s 2024-11-23 16:41:14,380 - Epoch 181, Loss: 0.0206, Throughput: 194.46 samples/s 2024-11-23 16:41:20,879 - Epoch 182, Loss: 0.0203, Throughput: 193.95 samples/s 2024-11-23 16:41:27,923 - Epoch 183, Loss: 0.0203, Throughput: 193.25 samples/s 2024-11-23 16:41:34,292 - Epoch 184, Loss: 0.0204, Throughput: 191.85 samples/s 2024-11-23 16:41:40,859 - Epoch 185, Loss: 0.0204, Throughput: 191.75 samples/s 2024-11-23 16:41:47,262 - Epoch 186, Loss: 0.0205, Throughput: 192.77 samples/s 2024-11-23 16:41:53,794 - Epoch 187, Loss: 0.0202, Throughput: 193.32 samples/s 2024-11-23 16:42:00,232 - Epoch 188, Loss: 0.0201, Throughput: 193.58 samples/s 2024-11-23 16:42:06,594 - Epoch 189, Loss: 0.0202, Throughput: 192.90 samples/s 2024-11-23 16:42:12,900 - Epoch 190, Loss: 0.0202, Throughput: 193.01 samples/s 2024-11-23 16:42:19,014 - Epoch 191, Loss: 0.0201, Throughput: 193.65 samples/s 2024-11-23 16:42:25,461 - Epoch 192, Loss: 0.0204, Throughput: 193.03 samples/s 2024-11-23 16:42:31,575 - Epoch 193, Loss: 0.0200, Throughput: 193.00 samples/s 2024-11-23 16:42:38,052 - Epoch 194, Loss: 0.0200, Throughput: 193.58 samples/s 2024-11-23 16:42:44,245 - Epoch 195, Loss: 0.0201, Throughput: 192.55 samples/s 2024-11-23 16:42:50,652 - Epoch 196, Loss: 0.0202, Throughput: 193.59 samples/s 2024-11-23 16:42:56,832 - Epoch 197, Loss: 0.0197, Throughput: 192.89 samples/s 2024-11-23 16:43:03,272 - Epoch 198, Loss: 0.0200, Throughput: 191.32 samples/s 2024-11-23 16:43:09,863 - Epoch 199, Loss: 0.0196, Throughput: 192.85 samples/s 2024-11-23 16:43:13,906 - Average Throughput: 191.67 samples/s 2024-11-23 16:43:13,906 - Average Step Time: 0.2001s 2024-11-23 16:43:13,906 - Average Memory Usage: 17170.99 MB 2024-11-23 16:43:13,906 - Average Power Usage: 155.80 W 2024-11-23 16:43:13,906 - Average Epoch Duration: 2.86s 2024-11-23 16:43:13,906 - Cost for 10k steps: 4.45 yuan, Total Time: 0.56 hours