================= CRYOSPARCW ======= 2021-02-05 04:53:10.481562 ========= Project P17 Job J433 Master jptitan Port 39002 =========================================================================== ========= monitor process now starting main process MAINPROCESS PID 795775 ========= monitor process now waiting for main process MAIN PID 795775 refine.newrun cryosparc_compute.jobs.jobregister ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** Running job J433 of type nonuniform_refine_new Running job on hostname %s jptitan Allocated Resources : {'fixed': {'SSD': True}, 'hostname': 'jptitan', 'lane': 'default', 'lane_type': 'default', 'license': True, 'licenses_acquired': 1, 'slots': {'CPU': [0, 1, 2, 3], 'GPU': [0], 'RAM': [0, 1, 2]}, 'target': {'cache_path': '/scratch', 'cache_quota_mb': None, 'cache_reserve_mb': 10000, 'desc': None, 'gpus': [{'id': 0, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 1, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 2, 'mem': 11554324480, 'name': 'GeForce RTX 2080 Ti'}], 'hostname': 'jptitan', 'lane': 'default', 'monitor_port': None, 'name': 'jptitan', 'resource_fixed': {'SSD': True}, 'resource_slots': {'CPU': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], 'GPU': [0, 1, 2], 'RAM': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]}, 'ssh_str': 'jparmache@jptitan', 'title': 'Worker node jptitan', 'type': 'node', 'worker_bin_path': '/data/software/cryosparc/cryosparc2_worker/bin/cryosparcw'}} ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 170 global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (494, 1, 2007, 81) 170 block size 256 grid size (494, 126, 1) global compute_resid_pow with (494, 1, 672, 44) 170 block size 256 grid size (494, 42, 1) global compute_resid_pow with (494, 1, 224, 24) 170 block size 256 grid size (494, 14, 1) global compute_resid_pow with (494, 1, 80, 12) 170 block size 256 grid size (494, 5, 1) global compute_resid_pow with (494, 1, 32, 8) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (494, 1, 16, 4) 170 block size 256 grid size (494, 1, 1) global compute_resid_pow with (494, 1, 8, 4) 170 block size 128 grid size (494, 8, 1) global compute_resid_pow with (494, 1, 19, 21) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat ========= sending heartbeat 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (494, 1, 2007, 81) 170 block size 256 grid size (494, 126, 1) global compute_resid_pow with (494, 1, 672, 44) 170 block size 256 grid size (494, 42, 1) global compute_resid_pow with (494, 1, 224, 24) 170 block size 256 grid size (494, 14, 1) global compute_resid_pow with (494, 1, 80, 12) 170 block size 256 grid size (494, 5, 1) global compute_resid_pow with (494, 1, 32, 8) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (494, 1, 16, 4) 170 block size 256 grid size (494, 1, 1) global compute_resid_pow with (494, 1, 8, 4) 170 block size 128 grid size (494, 8, 1) global compute_resid_pow with (494, 1, 19, 21) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (494, 1, 2007, 81) 170 block size 256 grid size (494, 126, 1) global compute_resid_pow with (494, 1, 672, 44) 170 block size 256 grid size (494, 42, 1) global compute_resid_pow with (494, 1, 224, 24) 170 block size 256 grid size (494, 14, 1) global compute_resid_pow with (494, 1, 80, 12) 170 block size 256 grid size (494, 5, 1) global compute_resid_pow with (494, 1, 32, 8) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (494, 1, 16, 4) 170 block size 256 grid size (494, 1, 1) global compute_resid_pow with (494, 1, 8, 4) 170 block size 128 grid size (494, 8, 1) global compute_resid_pow with (494, 1, 19, 21) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 170 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 170 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 170 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 170 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 170 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 170 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 170 block size 256 grid size (500, 2, 1) global compute_resid_pow with (494, 1, 2007, 81) 170 block size 256 grid size (494, 126, 1) global compute_resid_pow with (494, 1, 672, 44) 170 block size 256 grid size (494, 42, 1) global compute_resid_pow with (494, 1, 224, 24) 170 block size 256 grid size (494, 14, 1) global compute_resid_pow with (494, 1, 80, 12) 170 block size 256 grid size (494, 5, 1) global compute_resid_pow with (494, 1, 32, 8) 170 block size 256 grid size (494, 2, 1) global compute_resid_pow with (494, 1, 16, 4) 170 block size 256 grid size (494, 1, 1) global compute_resid_pow with (494, 1, 8, 4) 170 block size 128 grid size (494, 8, 1) global compute_resid_pow with (494, 1, 19, 21) 170 block size 256 grid size (494, 2, 1) FSC No-Mask... 0.143 at 28.447 radwn. 0.5 at 15.368 radwn. Took 2.172s. FSC Spherical Mask... 0.143 at 29.348 radwn. 0.5 at 16.297 radwn. Took 3.044s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 32.183 radwn. 0.5 at 20.151 radwn. Took 10.792s. FSC Tight Mask... ========= sending heartbeat 0.143 at 43.020 radwn. 0.5 at 30.916 radwn. Took 10.080s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (272, 1, 2007, 81) 218 block size 256 grid size (272, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (272, 1, 672, 44) 894 block size 256 grid size (272, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (272, 1, 224, 24) 2906 block size 256 grid size (272, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 80, 12) 2906 block size 256 grid size (272, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (272, 1, 32, 8) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (272, 1, 16, 4) 2906 block size 256 grid size (272, 1, 1) global compute_resid_pow with (272, 1, 8, 4) 2906 block size 128 grid size (272, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (272, 1, 2007, 81) 218 block size 256 grid size (272, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (272, 1, 672, 44) 894 block size 256 grid size (272, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (272, 1, 224, 24) 2906 block size 256 grid size (272, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 80, 12) 2906 block size 256 grid size (272, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (272, 1, 32, 8) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (272, 1, 16, 4) 2906 block size 256 grid size (272, 1, 1) global compute_resid_pow with (272, 1, 8, 4) 2906 block size 128 grid size (272, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with ========= sending heartbeat (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (272, 1, 2007, 81) 218 block size 256 grid size (272, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (272, 1, 672, 44) 894 block size 256 grid size (272, 42, 1) global compute_resid_pow with (272, 1, 224, 24) 2906 block size 256 grid size (272, 14, 1) global compute_resid_pow with (272, 1, 80, 12) 2906 block size 256 grid size (272, 5, 1) global compute_resid_pow with (272, 1, 32, 8) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (272, 1, 16, 4) 2906 block size 256 grid size (272, 1, 1) global compute_resid_pow with (272, 1, 8, 4) 2906 block size 128 grid size (272, 8, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (272, 1, 2007, 81) 218 block size 256 grid size (272, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (272, 1, 672, 44) 894 block size 256 grid size (272, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2906 block size 256 grid size (500, 14, 1) global compute_resid_pow with (272, 1, 224, 24) 2906 block size 256 grid size (272, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2906 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 80, 12) 2906 block size 256 grid size (272, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 2906 block size 256 grid size (500, 1, 1) global compute_resid_pow with (272, 1, 32, 8) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 2906 block size 128 grid size (500, 8, 1) global compute_resid_pow with (272, 1, 16, 4) 2906 block size 256 grid size (272, 1, 1) global compute_resid_pow with (272, 1, 8, 4) 2906 block size 128 grid size (272, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (272, 1, 19, 21) 2906 block size 256 grid size (272, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2906 block size 256 grid size (500, 2, 1) FSC No-Mask... ========= sending heartbeat 0.143 at 46.279 radwn. 0.5 at 39.024 radwn. Took 5.628s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 51.192 radwn. 0.5 at 41.291 radwn. Took 4.285s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 68.437 radwn. 0.5 at 44.222 radwn. Took 18.494s. FSC Tight Mask... ========= sending heartbeat 0.143 at 73.681 radwn. 0.5 at 52.261 radwn. Took 12.898s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) ========= sending heartbeat 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (362, 1, 2007, 81) 218 block size 256 grid size (362, 126, 1) global compute_resid_pow with (362, 1, 672, 44) 894 block size 256 grid size (362, 42, 1) global compute_resid_pow with (362, 1, 224, 24) 3604 block size 256 grid size (362, 14, 1) global compute_resid_pow with (362, 1, 80, 12) 8522 block size 256 grid size (362, 5, 1) global compute_resid_pow with (362, 1, 32, 8) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (362, 1, 16, 4) 8522 block size 256 grid size (362, 1, 1) global compute_resid_pow with (362, 1, 8, 4) 8522 block size 128 grid size (362, 8, 1) global compute_resid_pow with (362, 1, 19, 21) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (362, 1, 19, 21) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (363, 1, 2007, 81) 218 block size 256 grid size (363, 126, 1) global compute_resid_pow with (363, 1, 672, 44) 894 block size 256 grid size (363, 42, 1) global compute_resid_pow with (363, 1, 224, 24) 3604 block size 256 grid size (363, 14, 1) global compute_resid_pow with (363, 1, 80, 12) 8522 block size 256 grid size (363, 5, 1) global compute_resid_pow with (363, 1, 32, 8) 8522 block size 256 grid size (363, 2, 1) global compute_resid_pow with (363, 1, 16, 4) 8522 block size 256 grid size (363, 1, 1) global compute_resid_pow with (363, 1, 8, 4) 8522 block size 128 grid size (363, 8, 1) global compute_resid_pow with (363, 1, 19, 21) 8522 block size 256 grid size (363, 2, 1) global compute_resid_pow with (363, 1, 19, 21) 8522 block size 256 grid size (363, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (362, 1, 2007, 81) 218 block size 256 grid size (362, 126, 1) global compute_resid_pow with (362, 1, 672, 44) 894 block size 256 grid size (362, 42, 1) global compute_resid_pow with (362, 1, 224, 24) 3604 block size 256 grid size (362, 14, 1) global compute_resid_pow with (362, 1, 80, 12) 8522 block size 256 grid size (362, 5, 1) global compute_resid_pow with (362, 1, 32, 8) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (362, 1, 16, 4) 8522 block size 256 grid size (362, 1, 1) global compute_resid_pow with (362, 1, 8, 4) 8522 block size 128 grid size (362, 8, 1) global compute_resid_pow with (362, 1, 19, 21) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (362, 1, 19, 21) 8522 block size 256 grid size (362, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat ========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 8522 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 8522 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 8522 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 8522 block size 256 grid size (500, 2, 1) global compute_resid_pow with (363, 1, 2007, 81) 218 block size 256 grid size (363, 126, 1) global compute_resid_pow with (363, 1, 672, 44) 894 block size 256 grid size (363, 42, 1) global compute_resid_pow with (363, 1, 224, 24) 3604 block size 256 grid size (363, 14, 1) global compute_resid_pow with (363, 1, 80, 12) 8522 block size 256 grid size (363, 5, 1) global compute_resid_pow with (363, 1, 32, 8) 8522 block size 256 grid size (363, 2, 1) global compute_resid_pow with (363, 1, 16, 4) 8522 block size 256 grid size (363, 1, 1) global compute_resid_pow with (363, 1, 8, 4) 8522 block size 128 grid size (363, 8, 1) global compute_resid_pow with (363, 1, 19, 21) 8522 block size 256 grid size (363, 2, 1) global compute_resid_pow with (363, 1, 19, 21) 8522 block size 256 grid size (363, 2, 1) FSC No-Mask... 0.143 at 77.787 radwn. 0.5 at 48.463 radwn. Took 2.660s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 85.496 radwn. 0.5 at 67.538 radwn. Took 3.808s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 92.524 radwn. 0.5 at 74.153 radwn. Took 18.973s. FSC Tight Mask... ========= sending heartbeat 0.143 at 96.954 radwn. 0.5 at 82.657 radwn. Took 13.420s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (400, 1, 2007, 81) 218 block size 256 grid size (400, 126, 1) global compute_resid_pow with (400, 1, 672, 44) 894 block size 256 grid size (400, 42, 1) global compute_resid_pow with (400, 1, 224, 24) 3604 block size 256 grid size (400, 14, 1) global compute_resid_pow with (400, 1, 80, 12) 14456 block size 256 grid size (400, 5, 1) global compute_resid_pow with (400, 1, 32, 8) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (400, 1, 16, 4) 14756 block size 256 grid size (400, 1, 1) global compute_resid_pow with (400, 1, 8, 4) 14756 block size 128 grid size (400, 8, 1) global compute_resid_pow with (400, 1, 19, 21) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (400, 1, 19, 21) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (401, 1, 2007, 81) 218 block size 256 grid size (401, 126, 1) global compute_resid_pow with (401, 1, 672, 44) 894 block size 256 grid size (401, 42, 1) global compute_resid_pow with (401, 1, 224, 24) 3604 block size 256 grid size (401, 14, 1) global compute_resid_pow with (401, 1, 80, 12) 14456 block size 256 grid size (401, 5, 1) global compute_resid_pow with (401, 1, 32, 8) 14756 block size 256 grid size (401, 2, 1) global compute_resid_pow with (401, 1, 16, 4) 14756 block size 256 grid size (401, 1, 1) global compute_resid_pow with (401, 1, 8, 4) 14756 block size 128 grid size (401, 8, 1) global compute_resid_pow with (401, 1, 19, 21) 14756 block size 256 grid size (401, 2, 1) global compute_resid_pow with (401, 1, 19, 21) 14756 block size 256 grid size (401, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (400, 1, 2007, 81) 218 block size 256 grid size (400, 126, 1) global compute_resid_pow with (400, 1, 672, 44) 894 block size 256 grid size (400, 42, 1) global compute_resid_pow with (400, 1, 224, 24) 3604 block size 256 grid size (400, 14, 1) global compute_resid_pow with (400, 1, 80, 12) 14456 block size 256 grid size (400, 5, 1) global compute_resid_pow with (400, 1, 32, 8) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (400, 1, 16, 4) 14756 block size 256 grid size (400, 1, 1) global compute_resid_pow with (400, 1, 8, 4) 14756 block size 128 grid size (400, 8, 1) global compute_resid_pow with (400, 1, 19, 21) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (400, 1, 19, 21) 14756 block size 256 grid size (400, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 14756 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 14756 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 14756 block size 256 grid size (500, 2, 1) global compute_resid_pow with (401, 1, 2007, 81) 218 block size 256 grid size (401, 126, 1) global compute_resid_pow with (401, 1, 672, 44) 894 block size 256 grid size (401, 42, 1) global compute_resid_pow with (401, 1, 224, 24) 3604 block size 256 grid size (401, 14, 1) global compute_resid_pow with (401, 1, 80, 12) 14456 block size 256 grid size (401, 5, 1) global compute_resid_pow with (401, 1, 32, 8) 14756 block size 256 grid size (401, 2, 1) global compute_resid_pow with (401, 1, 16, 4) 14756 block size 256 grid size (401, 1, 1) global compute_resid_pow with (401, 1, 8, 4) 14756 block size 128 grid size (401, 8, 1) global compute_resid_pow with (401, 1, 19, 21) 14756 block size 256 grid size (401, 2, 1) global compute_resid_pow with (401, 1, 19, 21) 14756 block size 256 grid size (401, 2, 1) FSC No-Mask... 0.143 at 89.803 radwn. 0.5 at 72.512 radwn. Took 6.730s. FSC Spherical Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 95.018 radwn. 0.5 at 78.318 radwn. Took 5.771s. FSC Loose Mask... ========= sending heartbeat 0.143 at 98.595 radwn. 0.5 at 87.563 radwn. Took 17.312s. FSC Tight Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 103.200 radwn. 0.5 at 94.626 radwn. Took 13.386s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (57, 1, 2007, 81) 218 block size 256 grid size (57, 126, 1) global compute_resid_pow with (57, 1, 672, 44) 894 block size 256 grid size (57, 42, 1) global compute_resid_pow with (57, 1, 224, 24) 3604 block size 256 grid size (57, 14, 1) global compute_resid_pow with (57, 1, 80, 12) 14456 block size 256 grid size (57, 5, 1) global compute_resid_pow with (57, 1, 32, 8) 16726 block size 256 grid size (57, 2, 1) global compute_resid_pow with (57, 1, 16, 4) 16726 block size 256 grid size (57, 1, 1) global compute_resid_pow with (57, 1, 8, 4) 16726 block size 128 grid size (57, 8, 1) global compute_resid_pow with (57, 1, 19, 21) 16726 block size========= sending heartbeat 256 grid size (57, 2, 1) global compute_resid_pow with (57, 1, 19, 21) 16726 block size 256 grid size (57, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size========= sending heartbeat (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat ========= sending heartbeat (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (58, 1, 2007, 81) 218 block size 256 grid size (58, 126, 1) global compute_resid_pow with (58, 1, 672, 44) 894 block size 256 grid size (58, 42, 1) global compute_resid_pow with (58, 1, 224, 24) 3604 block size 256 grid size (58, 14, 1) global compute_resid_pow with (58, 1, 80, 12) 14456 block size 256 grid size (58, 5, 1) global compute_resid_pow with (58, 1, 32, 8) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (58, 1, 16, 4) 16726 block size 256 grid size (58, 1, 1) global compute_resid_pow with (58, 1, 8, 4) 16726 block size 128 grid size (58, 8, 1) global compute_resid_pow with (58, 1, 19, 21) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (58, 1, 19, 21) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (57, 1, 2007, 81) 218 block size 256 grid size (57, 126, 1) global compute_resid_pow with (57, 1, 672, 44) 894 block size 256 grid size (57, 42, 1) global compute_resid_pow with (57, 1, 224, 24) 3604 block size 256 grid size (57, 14, 1) global compute_resid_pow with (57, 1, 80, 12) 14456 block size 256 grid size (57, 5, 1) global compute_resid_pow with (57, 1, 32, 8) 16726 block size 256 grid size (57, 2, 1) global compute_resid_pow with (57, 1, 16, 4) 16726 block size 256 grid size (57, 1, 1) global compute_resid_pow with (57, 1, 8, 4) 16726 block size 128 grid size (57, 8, 1) global compute_resid_pow with (57, 1, 19, 21) 16726 block size 256 grid size (57, 2, 1) global compute_resid_pow with (57, 1, 19, 21) 16726 block size 256 grid size (57, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (58, 1, 2007, 81) 218 block size 256 grid size (58, 126, 1) global compute_resid_pow with (58, 1, 672, 44) 894 block size 256 grid size (58, 42, 1) global compute_resid_pow with (58, 1, 224, 24) 3604 block size 256 grid size (58, 14, 1) global compute_resid_pow with (58, 1, 80, 12) 14456 block size 256 grid size (58, 5, 1) global compute_resid_pow with (58, 1, 32, 8) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (58, 1, 16, 4) 16726 block size 256 grid size (58, 1, 1) global compute_resid_pow with (58, 1, 8, 4) 16726 block size 128 grid size (58, 8, 1) global compute_resid_pow with (58, 1, 19, 21) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (58, 1, 19, 21) 16726 block size 256 grid size (58, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16726 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16726 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16726 block size 256 grid size (500, 2, 1) FSC No-Mask... ========= sending heartbeat 0.143 at 94.159 radwn. 0.5 at 73.913 radwn. Took 4.431s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 97.547 radwn. 0.5 at 80.768 radwn. Took 5.906s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 100.796 radwn. 0.5 at 93.304 radwn. Took 18.995s. FSC Tight Mask... ========= sending heartbeat 0.143 at 105.335 radwn. 0.5 at 97.841 radwn. Took 14.271s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 17430 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 17430 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 17430 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 17430 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 17430 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 17430 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17430 block size 256 grid size (500, 1, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 17430 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17430 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 17430 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 17430 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 17430 block size 256 grid size (495, 2, 1) FSC No-Mask... ========= sending heartbeat 0.143 at 97.264 radwn. 0.5 at 79.345 radwn. Took 7.902s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.345 radwn. 0.5 at 90.463 radwn. Took 5.796s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 104.084 radwn. 0.5 at 96.769 radwn. Took 18.324s. FSC Tight Mask... ========= sending heartbeat 0.143 at 109.554 radwn. 0.5 at 99.785 radwn. Took 14.703s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 109.041 radwn. 0.5 at 99.691 radwn. Took 26.812s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1)========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18668 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18668 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18668 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18668 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18668 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18668 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat ========= sending heartbeat 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18668 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18668 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18668 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18668 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18668 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18668 block size 256 grid size (495, 2, 1) FSC No-Mask... 0.143 at 97.654 radwn. 0.5 at 79.428 radwn. Took 5.871s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.670 radwn. 0.5 at 91.972 radwn. Took 5.793s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 104.651 radwn. 0.5 at 97.240 radwn. Took 17.575s. FSC Tight Mask... ========= sending heartbeat 0.143 at 110.096 radwn. 0.5 at 100.121 radwn. Took 13.748s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 109.936 radwn. 0.5 at 99.973 radwn. Took 28.654s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18980 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18980 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18980 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18980 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18980 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18980 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21)========= sending heartbeat 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size========= sending heartbeat (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18980 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18980 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18980 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 18980 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 18980 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 18980 block size 256 grid size (495, 2, 1) FSC No-Mask... ========= sending heartbeat 0.143 at 97.827 radwn. 0.5 at 79.725 radwn. Took 4.278s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.859 radwn. 0.5 at 92.740 radwn. Took 4.410s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 105.200 radwn. 0.5 at 97.460 radwn. Took 20.056s. FSC Tight Mask... ========= sending heartbeat 0.143 at 110.369 radwn. 0.5 at 100.237 radwn. Took 13.183s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 110.409 radwn. 0.5 at 100.190 radwn. Took 27.955s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/site-packages/skcuda/cublas.py:284: UserWarning: creating CUBLAS context to get version number warnings.warn('creating CUBLAS context to get version number') /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 19148 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 19148 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 19148 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 19148 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 19148 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 19148 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) ========= sending heartbeat 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat ========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (495, 1, 2007, 81) 218 block size 256 grid size (495, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 672, 44) 894 block size 256 grid size (495, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19148 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19148 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 224, 24) 3604 block size 256 grid size (495, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19148 block size 256 grid size (500, 2, 1) global compute_resid_pow with (495, 1, 80, 12) 14456 block size 256 grid size (495, 5, 1) global compute_resid_pow with (495, 1, 32, 8) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 16, 4) 19148 block size 256 grid size (495, 1, 1) global compute_resid_pow with (495, 1, 8, 4) 19148 block size 128 grid size (495, 8, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) global compute_resid_pow with (495, 1, 19, 21) 19148 block size 256 grid size (495, 2, 1) FSC No-Mask... 0.143 at 97.903 radwn. 0.5 at 79.745 radwn. Took 6.013s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.814 radwn. 0.5 at 93.177 radwn. Took 5.526s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 105.495 radwn. 0.5 at 97.533 radwn. Took 17.425s. FSC Tight Mask... ========= sending heartbeat 0.143 at 110.526 radwn. 0.5 at 100.189 radwn. Took 13.045s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 110.748 radwn. 0.5 at 100.015 radwn. Took 25.338s. ---- Computing FSC with mask 2.00 to 6.00 FSC No-Mask... 0.143 at 97.903 radwn. 0.5 at 79.745 radwn. Took 1.935s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.814 radwn. 0.5 at 93.177 radwn. Took 2.761s. FSC Loose Mask... ========= sending heartbeat 0.143 at 105.495 radwn. 0.5 at 97.533 radwn. Took 10.527s. FSC Tight Mask... ========= sending heartbeat 0.143 at 115.968 radwn. 0.5 at 102.474 radwn. Took 10.511s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 110.736 radwn. 0.5 at 100.312 radwn. Took 24.386s. ---- Computing FSC with mask 2.25 to 7.00 FSC No-Mask... 0.143 at 97.903 radwn. 0.5 at 79.745 radwn. Took 2.084s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.814 radwn. 0.5 at 93.177 radwn. Took 2.889s. FSC Loose Mask... ========= sending heartbeat 0.143 at 105.495 radwn. 0.5 at 97.533 radwn. Took 10.027s. FSC Tight Mask... ========= sending heartbeat 0.143 at 113.376 radwn. 0.5 at 101.903 radwn. Took 10.747s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 111.253 radwn. 0.5 at 100.655 radwn. Took 26.627s. ---- Computing FSC with mask 2.50 to 8.00 FSC No-Mask... 0.143 at 97.903 radwn. 0.5 at 79.745 radwn. Took 2.116s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.814 radwn. 0.5 at 93.177 radwn. Took 3.012s. FSC Loose Mask... ========= sending heartbeat 0.143 at 105.495 radwn. 0.5 at 97.533 radwn. Took 9.989s. FSC Tight Mask... ========= sending heartbeat 0.143 at 112.488 radwn. 0.5 at 101.500 radwn. Took 10.366s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 111.350 radwn. 0.5 at 100.734 radwn. Took 22.681s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) ========= main process now complete. ========= monitor process now complete.