-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathtime_stats
More file actions
1611 lines (1611 loc) · 48.7 KB
/
time_stats
File metadata and controls
1611 lines (1611 loc) · 48.7 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
time to load data: 1.291800
time for layer 1: 0.504165
time to compute cost: 0.011752
time for layer 0 backward: 0.896909
time for iteration 0: 1
time for layer 1: 0.467938
time to compute cost: 0.011048
time for layer 0 backward: 0.859391
time for iteration 1: 1
time for layer 1: 0.640718
time to compute cost: 0.011097
time for layer 0 backward: 0.842786
time for iteration 2: 1
time for layer 1: 0.476936
time to compute cost: 0.014569
time for layer 0 backward: 0.916284
time for iteration 3: 1
time for layer 1: 0.549470
time to compute cost: 0.012260
time for layer 0 backward: 0.802521
time for iteration 4: 1
time for layer 1: 0.461436
time to compute cost: 0.012530
time for layer 0 backward: 0.822232
time for iteration 5: 1
time for layer 1: 0.462847
time to compute cost: 0.010709
time for layer 0 backward: 0.912425
time for iteration 6: 1
time for layer 1: 0.520109
time to compute cost: 0.014847
time for layer 0 backward: 0.829137
time for iteration 7: 1
time for layer 1: 0.450326
time to compute cost: 0.013441
time for layer 0 backward: 0.830999
time for iteration 8: 1
time for layer 1: 0.474780
time to compute cost: 0.011511
time for layer 0 backward: 0.835166
time for iteration 9: 1
time for layer 1: 0.498843
time to compute cost: 0.012784
time for layer 0 backward: 0.824927
time for iteration 10: 1
time for layer 1: 0.460064
time to compute cost: 0.012500
time for layer 0 backward: 0.799138
time for iteration 11: 1
time for layer 1: 0.457477
time to compute cost: 0.009190
time for layer 0 backward: 0.872432
time for iteration 12: 1
time for layer 1: 0.541299
time to compute cost: 0.015466
time for layer 0 backward: 0.848268
time for iteration 13: 1
time for layer 1: 0.461293
time to compute cost: 0.015329
time for layer 0 backward: 0.807747
time for iteration 14: 1
time for layer 1: 0.450444
time to compute cost: 0.014895
time for layer 0 backward: 0.817397
time for iteration 15: 1
time for layer 1: 0.462807
time to compute cost: 0.015459
time for layer 0 backward: 0.791775
time for iteration 16: 1
time for layer 1: 0.515686
time to compute cost: 0.013687
time for layer 0 backward: 0.886295
time for iteration 17: 1
time for layer 1: 0.459916
time to compute cost: 0.014806
time for layer 0 backward: 0.840784
time for iteration 18: 1
time for layer 1: 0.582513
time to compute cost: 0.015999
time for layer 0 backward: 0.847421
time for iteration 19: 1
time for layer 1: 0.533678
time to compute cost: 0.011689
time for layer 0 backward: 0.880136
time for iteration 20: 1
time for layer 1: 0.465503
time to compute cost: 0.013903
time for layer 0 backward: 0.860026
time for iteration 21: 1
time for layer 1: 0.466851
time to compute cost: 0.011908
time for layer 0 backward: 0.848431
time for iteration 22: 1
time for layer 1: 0.522639
time to compute cost: 0.018178
time for layer 0 backward: 0.865567
time for iteration 23: 1
time for layer 1: 0.456230
time to compute cost: 0.014687
time for layer 0 backward: 0.791491
time for iteration 24: 1
time for layer 1: 0.523271
time to compute cost: 0.016405
time for layer 0 backward: 0.806274
time for iteration 25: 1
time for layer 1: 0.461545
time to compute cost: 0.013677
time for layer 0 backward: 0.887600
time for iteration 26: 1
time for layer 1: 0.458078
time to compute cost: 0.015006
time for layer 0 backward: 0.805663
time for iteration 27: 1
time for layer 1: 0.474623
time to compute cost: 0.011908
time for layer 0 backward: 0.796922
time for iteration 28: 1
time for layer 1: 0.443889
time to compute cost: 0.016515
time for layer 0 backward: 0.806709
time for iteration 29: 1
time for layer 1: 0.458478
time to compute cost: 0.012233
time for layer 0 backward: 0.899759
time for iteration 30: 1
time for layer 1: 0.468482
time to compute cost: 0.014736
time for layer 0 backward: 0.818710
time for iteration 31: 1
time for layer 1: 0.451267
time to compute cost: 0.010439
time for layer 0 backward: 0.786278
time for iteration 32: 1
time for layer 1: 0.527260
time to compute cost: 0.015803
time for layer 0 backward: 0.843316
time for iteration 33: 1
time for layer 1: 0.458186
time to compute cost: 0.015724
time for layer 0 backward: 0.810989
time for iteration 34: 1
time for layer 1: 0.448491
time to compute cost: 0.013464
time for layer 0 backward: 0.850656
time for iteration 35: 1
time for layer 1: 0.459329
time to compute cost: 0.013574
time for layer 0 backward: 0.789716
time for iteration 36: 1
time for layer 1: 0.527243
time to compute cost: 0.018490
time for layer 0 backward: 0.856915
time for iteration 37: 1
time for layer 1: 0.449790
time to compute cost: 0.013440
time for layer 0 backward: 0.839071
time for iteration 38: 1
time for layer 1: 0.457556
time to compute cost: 0.014499
time for layer 0 backward: 0.818253
time for iteration 39: 1
time for layer 1: 0.515400
time to compute cost: 0.012514
time for layer 0 backward: 0.849938
time for iteration 40: 1
time for layer 1: 0.442571
time to compute cost: 0.013924
time for layer 0 backward: 0.819367
time for iteration 41: 1
time for layer 1: 0.459540
time to compute cost: 0.012756
time for layer 0 backward: 0.806468
time for iteration 42: 1
time for layer 1: 0.459827
time to compute cost: 0.014031
time for layer 0 backward: 0.867952
time for iteration 43: 1
time for layer 1: 0.683987
time to compute cost: 0.014633
time for layer 0 backward: 0.951088
time for iteration 44: 1
time for layer 1: 0.708007
time to compute cost: 0.022344
time for layer 0 backward: 1.013864
time for iteration 45: 2
time for layer 1: 0.508083
time to compute cost: 0.012313
time for layer 0 backward: 0.946468
time for iteration 46: 1
time for layer 1: 0.535857
time to compute cost: 0.014419
time for layer 0 backward: 0.934707
time for iteration 47: 1
time for layer 1: 0.622969
time to compute cost: 0.010872
time for layer 0 backward: 0.733483
time for iteration 48: 1
time for layer 1: 0.431025
time to compute cost: 0.011965
time for layer 0 backward: 0.855835
time for iteration 49: 1
time for layer 1: 0.473537
time to compute cost: 0.010977
time for layer 0 backward: 0.813245
time for iteration 50: 1
time for layer 1: 0.447162
time to compute cost: 0.010766
time for layer 0 backward: 0.831609
time for iteration 51: 1
time for layer 1: 0.426137
time to compute cost: 0.011068
time for layer 0 backward: 0.840315
time for iteration 52: 1
time for layer 1: 0.494742
time to compute cost: 0.011481
time for layer 0 backward: 0.782604
time for iteration 53: 1
time for layer 1: 0.430415
time to compute cost: 0.010430
time for layer 0 backward: 0.858589
time for iteration 54: 1
time for layer 1: 0.572085
time to compute cost: 0.013878
time for layer 0 backward: 0.983658
time for iteration 55: 1
time for layer 1: 0.546741
time to compute cost: 0.012305
time for layer 0 backward: 0.887171
time for iteration 56: 1
time for layer 1: 0.486250
time to compute cost: 0.028249
time for layer 0 backward: 0.893499
time for iteration 57: 1
time for layer 1: 0.525670
time to compute cost: 0.012670
time for layer 0 backward: 1.014915
time for iteration 58: 1
time for layer 1: 0.574301
time to compute cost: 0.012405
time for layer 0 backward: 0.788018
time for iteration 59: 1
time for layer 1: 0.429477
time to compute cost: 0.010714
time for layer 0 backward: 0.776010
time for iteration 60: 1
time for layer 1: 0.449613
time to compute cost: 0.011836
time for layer 0 backward: 0.760760
time for iteration 61: 1
time for layer 1: 0.425126
time to compute cost: 0.011244
time for layer 0 backward: 0.939823
time for iteration 62: 1
time for layer 1: 0.511377
time to compute cost: 0.012886
time for layer 0 backward: 0.817490
time for iteration 63: 1
time for layer 1: 0.555032
time to compute cost: 0.010148
time for layer 0 backward: 0.877296
time for iteration 64: 1
time for layer 1: 0.462113
time to compute cost: 0.012337
time for layer 0 backward: 0.780819
time for iteration 65: 1
time for layer 1: 0.439439
time to compute cost: 0.010953
time for layer 0 backward: 0.920250
time for iteration 66: 1
time for layer 1: 0.467948
time to compute cost: 0.011972
time for layer 0 backward: 0.802839
time for iteration 67: 1
time for layer 1: 0.415618
time to compute cost: 0.014835
time for layer 0 backward: 0.915369
time for iteration 68: 1
time for layer 1: 0.467663
time to compute cost: 0.012454
time for layer 0 backward: 0.825822
time for iteration 69: 1
time for layer 1: 0.454591
time to compute cost: 0.010520
time for layer 0 backward: 0.760481
time for iteration 70: 1
time for layer 1: 0.483148
time to compute cost: 0.012771
time for layer 0 backward: 0.812415
time for iteration 71: 1
time for layer 1: 0.419255
time to compute cost: 0.011940
time for layer 0 backward: 0.748598
time for iteration 72: 1
time for layer 1: 0.477530
time to compute cost: 0.013684
time for layer 0 backward: 0.830483
time for iteration 73: 1
time for layer 1: 0.462435
time to compute cost: 0.011758
time for layer 0 backward: 0.785873
time for iteration 74: 1
time for layer 1: 0.414903
time to compute cost: 0.012697
time for layer 0 backward: 0.731358
time for iteration 75: 1
time for layer 1: 0.414862
time to compute cost: 0.019594
time for layer 0 backward: 0.815396
time for iteration 76: 1
time for layer 1: 0.416162
time to compute cost: 0.012750
time for layer 0 backward: 0.719499
time for iteration 77: 1
time for layer 1: 0.457499
time to compute cost: 0.012334
time for layer 0 backward: 0.835160
time for iteration 78: 1
time for layer 1: 0.424908
time to compute cost: 0.013412
time for layer 0 backward: 0.722525
time for iteration 79: 1
time for layer 1: 0.454957
time to compute cost: 0.012742
time for layer 0 backward: 0.755372
time for iteration 80: 1
time for layer 1: 0.451402
time to compute cost: 0.014191
time for layer 0 backward: 0.763353
time for iteration 81: 1
time for layer 1: 0.414300
time to compute cost: 0.011885
time for layer 0 backward: 0.727500
time for iteration 82: 1
time for layer 1: 0.409283
time to compute cost: 0.011533
time for layer 0 backward: 0.821062
time for iteration 83: 1
time for layer 1: 0.588499
time to compute cost: 0.015429
time for layer 0 backward: 0.920361
time for iteration 84: 1
time for layer 1: 0.497459
time to compute cost: 0.013758
time for layer 0 backward: 0.794978
time for iteration 85: 1
time for layer 1: 0.436329
time to compute cost: 0.010867
time for layer 0 backward: 0.817045
time for iteration 86: 1
time for layer 1: 0.440699
time to compute cost: 0.028229
time for layer 0 backward: 0.907990
time for iteration 87: 1
time for layer 1: 0.686083
time to compute cost: 0.012063
time for layer 0 backward: 0.890253
time for iteration 88: 1
time for layer 1: 0.493782
time to compute cost: 0.014938
time for layer 0 backward: 0.806737
time for iteration 89: 1
time for layer 1: 0.511901
time to compute cost: 0.011799
time for layer 0 backward: 0.834116
time for iteration 90: 1
time for layer 1: 0.505367
time to compute cost: 0.013060
time for layer 0 backward: 0.827567
time for iteration 91: 1
time for layer 1: 0.485507
time to compute cost: 0.012074
time for layer 0 backward: 0.760863
time for iteration 92: 1
time for layer 1: 0.480971
time to compute cost: 0.013747
time for layer 0 backward: 0.951221
time for iteration 93: 1
time for layer 1: 0.494741
time to compute cost: 0.012988
time for layer 0 backward: 0.830375
time for iteration 94: 1
time for layer 1: 0.424279
time to compute cost: 0.014261
time for layer 0 backward: 0.750079
time for iteration 95: 1
time for layer 1: 0.438525
time to compute cost: 0.012384
time for layer 0 backward: 0.800283
time for iteration 96: 1
time for layer 1: 0.569070
time to compute cost: 0.013575
time for layer 0 backward: 0.945059
time for iteration 97: 1
time for layer 1: 0.445414
time to compute cost: 0.013813
time for layer 0 backward: 0.795450
time for iteration 98: 1
time for layer 1: 0.441611
time to compute cost: 0.014643
time for layer 0 backward: 0.794742
time for iteration 99: 1
time for layer 1: 0.469412
time to compute cost: 0.011699
time for layer 0 backward: 0.895826
time for iteration 100: 1
time for layer 1: 0.483532
time to compute cost: 0.013465
time for layer 0 backward: 0.798212
time for iteration 101: 1
time for layer 1: 0.493127
time to compute cost: 0.014416
time for layer 0 backward: 0.746682
time for iteration 102: 1
time for layer 1: 0.463767
time to compute cost: 0.013330
time for layer 0 backward: 0.775228
time for iteration 103: 1
time for layer 1: 0.489408
time to compute cost: 0.014409
time for layer 0 backward: 0.831969
time for iteration 104: 1
time for layer 1: 0.433281
time to compute cost: 0.013914
time for layer 0 backward: 0.725913
time for iteration 105: 1
time for layer 1: 0.426578
time to compute cost: 0.013404
time for layer 0 backward: 0.746268
time for iteration 106: 1
time for layer 1: 0.491702
time to compute cost: 0.015559
time for layer 0 backward: 0.770718
time for iteration 107: 1
time for layer 1: 0.408825
time to compute cost: 0.012492
time for layer 0 backward: 0.739132
time for iteration 108: 1
time for layer 1: 0.429187
time to compute cost: 0.012797
time for layer 0 backward: 0.707565
time for iteration 109: 1
time for layer 1: 0.441769
time to compute cost: 0.012320
time for layer 0 backward: 0.727491
time for iteration 110: 1
time for layer 1: 0.464497
time to compute cost: 0.013788
time for layer 0 backward: 0.759179
time for iteration 111: 1
time for layer 1: 0.421430
time to compute cost: 0.014967
time for layer 0 backward: 0.798786
time for iteration 112: 1
time for layer 1: 0.430493
time to compute cost: 0.015358
time for layer 0 backward: 0.802577
time for iteration 113: 1
time for layer 1: 0.494955
time to compute cost: 0.013089
time for layer 0 backward: 0.824544
time for iteration 114: 1
time for layer 1: 0.464520
time to compute cost: 0.012991
time for layer 0 backward: 0.787922
time for iteration 115: 1
time for layer 1: 0.431694
time to compute cost: 0.013702
time for layer 0 backward: 0.757309
time for iteration 116: 1
time for layer 1: 0.416390
time to compute cost: 0.012238
time for layer 0 backward: 0.745352
time for iteration 117: 1
time for layer 1: 0.528960
time to compute cost: 0.011131
time for layer 0 backward: 0.864908
time for iteration 118: 1
time for layer 1: 0.532230
time to compute cost: 0.015329
time for layer 0 backward: 0.796358
time for iteration 119: 1
time for layer 1: 0.493008
time to compute cost: 0.012374
time for layer 0 backward: 0.791247
time for iteration 120: 1
time for layer 1: 0.526261
time to compute cost: 0.028468
time for layer 0 backward: 0.938905
time for iteration 121: 1
time for layer 1: 0.474600
time to compute cost: 0.014618
time for layer 0 backward: 0.825402
time for iteration 122: 1
time for layer 1: 0.505905
time to compute cost: 0.022253
time for layer 0 backward: 0.972915
time for iteration 123: 1
time for layer 1: 0.558903
time to compute cost: 0.012154
time for layer 0 backward: 0.831711
time for iteration 124: 1
time for layer 1: 0.484056
time to compute cost: 0.014699
time for layer 0 backward: 0.814978
time for iteration 125: 1
time for layer 1: 0.459761
time to compute cost: 0.010431
time for layer 0 backward: 0.784990
time for iteration 126: 1
time for layer 1: 0.473711
time to compute cost: 0.012891
time for layer 0 backward: 0.804155
time for iteration 127: 1
time for layer 1: 0.550785
time to compute cost: 0.016600
time for layer 0 backward: 0.858237
time for iteration 128: 1
time for layer 1: 0.506669
time to compute cost: 0.013782
time for layer 0 backward: 0.860382
time for iteration 129: 1
time for layer 1: 0.454479
time to compute cost: 0.014483
time for layer 0 backward: 0.803046
time for iteration 130: 1
time for layer 1: 0.492292
time to compute cost: 0.017917
time for layer 0 backward: 0.857758
time for iteration 131: 1
time for layer 1: 0.479279
time to compute cost: 0.012494
time for layer 0 backward: 0.779445
time for iteration 132: 1
time for layer 1: 0.540950
time to compute cost: 0.015943
time for layer 0 backward: 0.808825
time for iteration 133: 1
time for layer 1: 0.431659
time to compute cost: 0.012753
time for layer 0 backward: 0.837000
time for iteration 134: 1
time for layer 1: 0.564457
time to compute cost: 0.036851
time for layer 0 backward: 0.871369
time for iteration 135: 1
time for layer 1: 0.445670
time to compute cost: 0.012704
time for layer 0 backward: 0.884041
time for iteration 136: 1
time for layer 1: 0.442792
time to compute cost: 0.013265
time for layer 0 backward: 0.754459
time for iteration 137: 1
time for layer 1: 0.542104
time to compute cost: 0.012695
time for layer 0 backward: 0.918953
time for iteration 138: 1
time for layer 1: 0.534227
time to compute cost: 0.023269
time for layer 0 backward: 0.759507
time for iteration 139: 1
time for layer 1: 0.437336
time to compute cost: 0.023798
time for layer 0 backward: 0.827772
time for iteration 140: 1
time for layer 1: 0.713617
time to compute cost: 0.013968
time for layer 0 backward: 0.864702
time for iteration 141: 1
time for layer 1: 0.473924
time to compute cost: 0.012649
time for layer 0 backward: 0.812614
time for iteration 142: 1
time for layer 1: 0.462442
time to compute cost: 0.022396
time for layer 0 backward: 0.794842
time for iteration 143: 1
time for layer 1: 0.496480
time to compute cost: 0.010748
time for layer 0 backward: 0.931334
time for iteration 144: 1
time for layer 1: 0.467699
time to compute cost: 0.046157
time for layer 0 backward: 0.845618
time for iteration 145: 1
time for layer 1: 0.447314
time to compute cost: 0.011783
time for layer 0 backward: 0.766025
time for iteration 146: 1
time for layer 1: 0.555315
time to compute cost: 0.012622
time for layer 0 backward: 0.840477
time for iteration 147: 1
time for layer 1: 0.458890
time to compute cost: 0.011988
time for layer 0 backward: 0.795446
time for iteration 148: 1
time for layer 1: 0.405033
time to compute cost: 0.012018
time for layer 0 backward: 0.747659
time for iteration 149: 1
time for layer 1: 0.415997
time to compute cost: 0.012419
time for layer 0 backward: 0.747504
time for iteration 150: 1
time for layer 1: 0.463299
time to compute cost: 0.016058
time for layer 0 backward: 0.821579
time for iteration 151: 1
time for layer 1: 0.416454
time to compute cost: 0.011905
time for layer 0 backward: 0.752593
time for iteration 152: 1
time for layer 1: 0.440715
time to compute cost: 0.013452
time for layer 0 backward: 0.758459
time for iteration 153: 1
time for layer 1: 0.444349
time to compute cost: 0.010919
time for layer 0 backward: 0.757274
time for iteration 154: 1
time for layer 1: 0.482624
time to compute cost: 0.015004
time for layer 0 backward: 0.804574
time for iteration 155: 1
time for layer 1: 0.422437
time to compute cost: 0.011851
time for layer 0 backward: 0.760773
time for iteration 156: 1
time for layer 1: 0.443466
time to compute cost: 0.012869
time for layer 0 backward: 0.826287
time for iteration 157: 1
time for layer 1: 0.479926
time to compute cost: 0.028230
time for layer 0 backward: 0.771975
time for iteration 158: 1
time for layer 1: 0.414815
time to compute cost: 0.014025
time for layer 0 backward: 0.746126
time for iteration 159: 1
time for layer 1: 0.431663
time to compute cost: 0.012261
time for layer 0 backward: 0.792271
time for iteration 160: 1
time for layer 1: 0.424061
time to compute cost: 0.012680
time for layer 0 backward: 0.828153
time for iteration 161: 1
time for layer 1: 0.452050
time to compute cost: 0.013553
time for layer 0 backward: 0.777233
time for iteration 162: 1
time for layer 1: 0.422442
time to compute cost: 0.013998
time for layer 0 backward: 0.736056
time for iteration 163: 1
time for layer 1: 0.439154
time to compute cost: 0.011776
time for layer 0 backward: 0.928295
time for iteration 164: 1
time for layer 1: 0.451754
time to compute cost: 0.025696
time for layer 0 backward: 0.896797
time for iteration 165: 1
time for layer 1: 0.548537
time to compute cost: 0.017774
time for layer 0 backward: 1.017253
time for iteration 166: 1
time for layer 1: 0.538727
time to compute cost: 0.016125
time for layer 0 backward: 0.979698
time for iteration 167: 1
time for layer 1: 0.632167
time to compute cost: 0.014634
time for layer 0 backward: 0.859050
time for iteration 168: 1
time for layer 1: 0.420752
time to compute cost: 0.012818
time for layer 0 backward: 0.753225
time for iteration 169: 1
time for layer 1: 0.492321
time to compute cost: 0.012870
time for layer 0 backward: 0.745204
time for iteration 170: 1
time for layer 1: 0.418961
time to compute cost: 0.014231
time for layer 0 backward: 0.736840
time for iteration 171: 1
time for layer 1: 0.461422
time to compute cost: 0.013223
time for layer 0 backward: 0.797505
time for iteration 172: 1
time for layer 1: 0.428140
time to compute cost: 0.013043
time for layer 0 backward: 0.839904
time for iteration 173: 1
time for layer 1: 0.449186
time to compute cost: 0.012816
time for layer 0 backward: 0.749918
time for iteration 174: 1
time for layer 1: 0.462052
time to compute cost: 0.014616
time for layer 0 backward: 0.824765
time for iteration 175: 1
time for layer 1: 0.415474
time to compute cost: 0.013991
time for layer 0 backward: 0.750368
time for iteration 176: 1
time for layer 1: 0.456812
time to compute cost: 0.012572
time for layer 0 backward: 0.746333
time for iteration 177: 1
time for layer 1: 0.408078
time to compute cost: 0.011943
time for layer 0 backward: 0.809465
time for iteration 178: 1
time for layer 1: 0.464331
time to compute cost: 0.012389
time for layer 0 backward: 0.749487
time for iteration 179: 1
time for layer 1: 0.434741
time to compute cost: 0.027828
time for layer 0 backward: 0.751898
time for iteration 180: 1
time for layer 1: 0.455772
time to compute cost: 0.013612
time for layer 0 backward: 0.748914
time for iteration 181: 1
time for layer 1: 0.494272
time to compute cost: 0.012580
time for layer 0 backward: 0.857971
time for iteration 182: 1
time for layer 1: 0.425378
time to compute cost: 0.011749
time for layer 0 backward: 0.770558
time for iteration 183: 1
time for layer 1: 0.415804
time to compute cost: 0.019232
time for layer 0 backward: 0.747324
time for iteration 184: 1
time for layer 1: 0.423095
time to compute cost: 0.013693
time for layer 0 backward: 0.830826
time for iteration 185: 1
time for layer 1: 0.450888
time to compute cost: 0.012667
time for layer 0 backward: 0.710713
time for iteration 186: 1
time for layer 1: 0.428275
time to compute cost: 0.012834
time for layer 0 backward: 0.736648
time for iteration 187: 1
time for layer 1: 0.432530
time to compute cost: 0.010571
time for layer 0 backward: 0.764106
time for iteration 188: 1
time for layer 1: 0.421612
time to compute cost: 0.029370
time for layer 0 backward: 0.854408
time for iteration 189: 1
time for layer 1: 0.441585
time to compute cost: 0.013262
time for layer 0 backward: 0.727439
time for iteration 190: 1
time for layer 1: 0.404055
time to compute cost: 0.012953
time for layer 0 backward: 0.722857
time for iteration 191: 1
time for layer 1: 0.415230
time to compute cost: 0.012412
time for layer 0 backward: 0.729520
time for iteration 192: 1
time for layer 1: 0.438982
time to compute cost: 0.014667
time for layer 0 backward: 0.797277
time for iteration 193: 1
time for layer 1: 0.467567
time to compute cost: 0.011585
time for layer 0 backward: 0.724416
time for iteration 194: 1
time for layer 1: 0.444372
time to compute cost: 0.012757
time for layer 0 backward: 0.762535
time for iteration 195: 1
time for layer 1: 0.433141
time to compute cost: 0.016847
time for layer 0 backward: 0.781775
time for iteration 196: 1
time for layer 1: 0.481722
time to compute cost: 0.013696
time for layer 0 backward: 0.756053
time for iteration 197: 1
time for layer 1: 0.416097
time to compute cost: 0.013662
time for layer 0 backward: 0.739755
time for iteration 198: 1
time for layer 1: 0.430108
time to compute cost: 0.013101
time for layer 0 backward: 0.749538
time for iteration 199: 1
time for layer 1: 0.453426
time to compute cost: 0.012564
time for layer 0 backward: 0.861771
time for iteration 200: 1
time for layer 1: 0.449627
time to compute cost: 0.012084
time for layer 0 backward: 0.716178
time for iteration 201: 1
time for layer 1: 0.436843
time to compute cost: 0.013383
time for layer 0 backward: 0.750798
time for iteration 202: 1
time for layer 1: 0.420066
time to compute cost: 0.013059
time for layer 0 backward: 0.800852
time for iteration 203: 1
time for layer 1: 0.481603
time to compute cost: 0.013240
time for layer 0 backward: 0.805666
time for iteration 204: 1
time for layer 1: 0.429548
time to compute cost: 0.012901
time for layer 0 backward: 0.752838
time for iteration 205: 1
time for layer 1: 0.422038
time to compute cost: 0.012714
time for layer 0 backward: 0.723795
time for iteration 206: 1
time for layer 1: 0.423576
time to compute cost: 0.012539
time for layer 0 backward: 0.805527
time for iteration 207: 1
time for layer 1: 0.452968
time to compute cost: 0.014086
time for layer 0 backward: 0.775496
time for iteration 208: 1
time for layer 1: 0.432002
time to compute cost: 0.012738
time for layer 0 backward: 0.731006
time for iteration 209: 1
time for layer 1: 0.418121
time to compute cost: 0.011655
time for layer 0 backward: 0.752629
time for iteration 210: 1
time for layer 1: 0.489303
time to compute cost: 0.014007
time for layer 0 backward: 0.747408
time for iteration 211: 1
time for layer 1: 0.423951
time to compute cost: 0.012341
time for layer 0 backward: 0.786519
time for iteration 212: 1
time for layer 1: 0.411682
time to compute cost: 0.012383
time for layer 0 backward: 0.740770
time for iteration 213: 1
time for layer 1: 0.433172
time to compute cost: 0.012389
time for layer 0 backward: 0.760479
time for iteration 214: 1
time for layer 1: 0.420905
time to compute cost: 0.012625
time for layer 0 backward: 0.881169
time for iteration 215: 1
time for layer 1: 0.436078
time to compute cost: 0.011636
time for layer 0 backward: 0.763259
time for iteration 216: 1
time for layer 1: 0.427028
time to compute cost: 0.012640
time for layer 0 backward: 0.764827
time for iteration 217: 1
time for layer 1: 0.442694
time to compute cost: 0.011579
time for layer 0 backward: 0.852661
time for iteration 218: 1
time for layer 1: 0.521178
time to compute cost: 0.018955
time for layer 0 backward: 0.806430
time for iteration 219: 1
time for layer 1: 0.421252
time to compute cost: 0.018317
time for layer 0 backward: 0.816064
time for iteration 220: 1
time for layer 1: 0.444715
time to compute cost: 0.013675
time for layer 0 backward: 0.730779
time for iteration 221: 1
time for layer 1: 0.475916
time to compute cost: 0.013647
time for layer 0 backward: 0.862669
time for iteration 222: 1
time for layer 1: 0.502753
time to compute cost: 0.014372
time for layer 0 backward: 0.868723
time for iteration 223: 1
time for layer 1: 0.454021
time to compute cost: 0.013958
time for layer 0 backward: 0.803687
time for iteration 224: 1
time for layer 1: 0.429331
time to compute cost: 0.013642
time for layer 0 backward: 0.915481
time for iteration 225: 1
time for layer 1: 0.467789
time to compute cost: 0.011323
time for layer 0 backward: 0.819657
time for iteration 226: 1
time for layer 1: 0.479004
time to compute cost: 0.013357
time for layer 0 backward: 0.878959
time for iteration 227: 1
time for layer 1: 0.452741
time to compute cost: 0.011928
time for layer 0 backward: 0.877221
time for iteration 228: 1
time for layer 1: 0.451781
time to compute cost: 0.011758
time for layer 0 backward: 0.751277
time for iteration 229: 1
time for layer 1: 0.445552
time to compute cost: 0.031936
time for layer 0 backward: 0.721055
time for iteration 230: 1
time for layer 1: 0.457498
time to compute cost: 0.013147
time for layer 0 backward: 0.757988
time for iteration 231: 1
time for layer 1: 0.422072
time to compute cost: 0.013269
time for layer 0 backward: 0.852187
time for iteration 232: 1
time for layer 1: 0.437518
time to compute cost: 0.013653
time for layer 0 backward: 0.799858
time for iteration 233: 1
time for layer 1: 0.421518
time to compute cost: 0.013403
time for layer 0 backward: 0.780096
time for iteration 234: 1
time for layer 1: 0.420657
time to compute cost: 0.012880
time for layer 0 backward: 0.736460
time for iteration 235: 1
time for layer 1: 0.493718
time to compute cost: 0.013171
time for layer 0 backward: 0.853267
time for iteration 236: 1
time for layer 1: 0.415388
time to compute cost: 0.013078
time for layer 0 backward: 0.722298
time for iteration 237: 1
time for layer 1: 0.400494
time to compute cost: 0.009597
time for layer 0 backward: 0.744176
time for iteration 238: 1
time for layer 1: 0.473230
time to compute cost: 0.013426
time for layer 0 backward: 0.754127
time for iteration 239: 1
time for layer 1: 0.447290
time to compute cost: 0.016848
time for layer 0 backward: 0.755284
time for iteration 240: 1
time for layer 1: 0.427936
time to compute cost: 0.012671
time for layer 0 backward: 0.758876
time for iteration 241: 1
time for layer 1: 0.421652
time to compute cost: 0.011803
time for layer 0 backward: 0.731695
time for iteration 242: 1
time for layer 1: 0.473220
time to compute cost: 0.013473
time for layer 0 backward: 0.811532
time for iteration 243: 1
time for layer 1: 0.420661
time to compute cost: 0.011183
time for layer 0 backward: 0.738826
time for iteration 244: 1
time for layer 1: 0.410699
time to compute cost: 0.026395
time for layer 0 backward: 0.767189
time for iteration 245: 1
time for layer 1: 0.431859
time to compute cost: 0.011877
time for layer 0 backward: 0.861412
time for iteration 246: 1
time for layer 1: 0.531107
time to compute cost: 0.014720
time for layer 0 backward: 0.862714
time for iteration 247: 1
time for layer 1: 0.468303
time to compute cost: 0.011189
time for layer 0 backward: 0.834006
time for iteration 248: 1
time for layer 1: 0.438335
time to compute cost: 0.012653
time for layer 0 backward: 0.814390