================= CRYOSPARCW ======= 2021-01-29 17:22:59.999048 ========= Project P17 Job J403 Master jptitan Port 39002 =========================================================================== ========= monitor process now starting main process MAINPROCESS PID 98000 ========= monitor process now waiting for main process MAIN PID 98000 refine.newrun cryosparc_compute.jobs.jobregister ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** Running job J403 of type nonuniform_refine_new Running job on hostname %s jptitan Allocated Resources : {'fixed': {'SSD': True}, 'hostname': 'jptitan', 'lane': 'default', 'lane_type': 'default', 'license': True, 'licenses_acquired': 1, 'slots': {'CPU': [0, 1, 2, 3], 'GPU': [0], 'RAM': [0, 1, 2]}, 'target': {'cache_path': '/scratch', 'cache_quota_mb': None, 'cache_reserve_mb': 10000, 'desc': None, 'gpus': [{'id': 0, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 1, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 2, 'mem': 11554324480, 'name': 'GeForce RTX 2080 Ti'}], 'hostname': 'jptitan', 'lane': 'default', 'monitor_port': None, 'name': 'jptitan', 'resource_fixed': {'SSD': True}, 'resource_slots': {'CPU': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], 'GPU': [0, 1, 2], 'RAM': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]}, 'ssh_str': 'jparmache@jptitan', 'title': 'Worker node jptitan', 'type': 'node', 'worker_bin_path': '/data/software/cryosparc/cryosparc2_worker/bin/cryosparcw'}} ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256========= sending heartbeat grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (318, 1, 2007, 81) 218 block size 256 grid size (318, 126, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (318, 1, 672, 44) 894 block size 256 grid size (318, 42, 1) global compute_resid_pow with (318, 1, 224, 24) 3604 block size 256 grid size (318, 14, 1) global compute_resid_pow with (318, 1, 80, 12) 4210 block size 256 grid size (318, 5, 1) global compute_resid_pow with (318, 1, 32, 8) 4210 block size 256 grid size (318, 2, 1) global compute_resid_pow with (318, 1, 16, 4) 4210 block size 256 grid size (318, 1, 1) global compute_resid_pow with (318, 1, 8, 4) 4210 block size 128 grid size (318, 8, 1) global compute_resid_pow with (318, 1, 19, 21) 4210 block size 256 grid size (318, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat ========= sending heartbeat 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat ========= sending heartbeat (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (319, 1, 2007, 81) 218 block size 256 grid size (319, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (319, 1, 672, 44) 894 block size 256 grid size (319, 42, 1) global compute_resid_pow with (319, 1, 224, 24) 3604 block size 256 grid size (319, 14, 1) global compute_resid_pow with (319, 1, 80, 12) 4210 block size 256 grid size (319, 5, 1) global compute_resid_pow with (319, 1, 32, 8) 4210 block size 256 grid size (319, 2, 1) global compute_resid_pow with (319, 1, 16, 4) 4210 block size 256 grid size (319, 1, 1) global compute_resid_pow with (319, 1, 8, 4) 4210 block size 128 grid size (319, 8, 1) global compute_resid_pow with (319, 1, 19, 21) 4210 block size 256 grid size (319, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (318, 1, 2007, 81) 218 block size 256 grid size (318, 126, 1) global compute_resid_pow with (318, 1, 672, 44) 894 block size 256 grid size (318, 42, 1) global compute_resid_pow with (318, 1, 224, 24) 3604 block size 256 grid size (318, 14, 1) global compute_resid_pow with (318, 1, 80, 12) 4210 block size 256 grid size (318, 5, 1) global compute_resid_pow with (318, 1, 32, 8) 4210 block size 256 grid size (318, 2, 1) global compute_resid_pow with (318, 1, 16, 4) 4210 block size 256 grid size (318, 1, 1) global compute_resid_pow with (318, 1, 8, 4) 4210 block size 128 grid size (318, 8, 1) global compute_resid_pow with (318, 1, 19, 21) 4210 block size 256 grid size (318, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with ========= sending heartbeat (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (319, 1, 2007, 81) 218 block size 256 grid size (319, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 4210 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 4210 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 4210 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 4210 block size 256 grid size (500, 2, 1) global compute_resid_pow with (319, 1, 672, 44) 894 block size 256 grid size (319, 42, 1) global compute_resid_pow with (319, 1, 224, 24) 3604 block size 256 grid size (319, 14, 1) global compute_resid_pow with (319, 1, 80, 12) 4210 block size 256 grid size (319, 5, 1) global compute_resid_pow with (319, 1, 32, 8) 4210 block size 256 grid size (319, 2, 1) global compute_resid_pow with (319, 1, 16, 4) 4210 block size 256 grid size (319, 1, 1) global compute_resid_pow with (319, 1, 8, 4) 4210 block size 128 grid size (319, 8, 1) global compute_resid_pow with (319, 1, 19, 21) 4210 block size 256 grid size (319, 2, 1) FSC No-Mask... 0.143 at 73.366 radwn. 0.5 at 50.176 radwn. Took 2.233s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 81.000 radwn. 0.5 at 52.992 radwn. Took 2.818s. FSC Loose Mask... ========= sending heartbeat 0.143 at 94.124 radwn. 0.5 at 76.186 radwn. Took 11.430s. FSC Tight Mask... ========= sending heartbeat 0.143 at 98.240 radwn. 0.5 at 83.721 radwn. Took 10.304s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (344, 1, 2007, 81) 218 block size 256 grid size (344, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (344, 1, 672, 44) 894 block size 256 grid size (344, 42, 1) global compute_resid_pow with (344, 1, 224, 24) 3604 block size 256 grid size (344, 14, 1) global compute_resid_pow with (344, 1, 80, 12) 14456 block size 256 grid size (344, 5, 1) global compute_resid_pow with (344, 1, 32, 8) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (344, 1, 16, 4) 15162 block size 256 grid size (344, 1, 1) global compute_resid_pow with (344, 1, 8, 4) 15162 block size 128 grid size (344, 8, 1) global compute_resid_pow with (344, 1, 19, 21) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (344, 1, 19, 21) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (345, 1, 2007, 81) 218 block size 256 grid size (345, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (345, 1, 672, 44) 894 block size 256 grid size (345, 42, 1) global compute_resid_pow with (345, 1, 224, 24) 3604 block size 256 grid size (345, 14, 1) global compute_resid_pow with (345, 1, 80, 12) 14456 block size 256 grid size (345, 5, 1) global compute_resid_pow with (345, 1, 32, 8) 15162 block size 256 grid size (345, 2, 1) global compute_resid_pow with (345, 1, 16, 4) 15162 block size 256 grid size (345, 1, 1) global compute_resid_pow with (345, 1, 8, 4) 15162 block size 128 grid size (345, 8, 1) global compute_resid_pow with (345, 1, 19, 21) 15162 block size 256 grid size (345, 2, 1) global compute_resid_pow with (345, 1, 19, 21) 15162 block size 256 grid size (345, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (344, 1, 2007, 81) 218 block size 256 grid size (344, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (344, 1, 672, 44) 894 block size 256 grid size (344, 42, 1) global compute_resid_pow with (344, 1, 224, 24) 3604 block size 256 grid size (344, 14, 1) global compute_resid_pow with (344, 1, 80, 12) 14456 block size 256 grid size (344, 5, 1) global compute_resid_pow with (344, 1, 32, 8) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (344, 1, 16, 4) 15162 block size 256 grid size (344, 1, 1) global compute_resid_pow with (344, 1, 8, 4) 15162 block size 128 grid size (344, 8, 1) global compute_resid_pow with (344, 1, 19, 21) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (344, 1, 19, 21) 15162 block size 256 grid size (344, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (345, 1, 2007, 81) 218 block size 256 grid size (345, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15162 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15162 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15162 block size 256 grid size (500, 2, 1) global compute_resid_pow with (345, 1, 672, 44) 894 block size 256 grid size (345, 42, 1) global compute_resid_pow with (345, 1, 224, 24) 3604 block size 256 grid size (345, 14, 1) global compute_resid_pow with (345, 1, 80, 12) 14456 block size 256 grid size (345, 5, 1) global compute_resid_pow with (345, 1, 32, 8) 15162 block size 256 grid size (345, 2, 1) global compute_resid_pow with (345, 1, 16, 4) 15162 block size 256 grid size (345, 1, 1) global compute_resid_pow with (345, 1, 8, 4) 15162 block size 128 grid size (345, 8, 1) global compute_resid_pow with (345, 1, 19, 21) 15162 block size 256 grid size (345, 2, 1) global compute_resid_pow with (345, 1, 19, 21) 15162 block size 256 grid size (345, 2, 1) FSC No-Mask... ========= sending heartbeat 0.143 at 92.668 radwn. 0.5 at 73.008 radwn. Took 3.112s. FSC Spherical Mask... 0.143 at 96.625 radwn. 0.5 at 79.013 radwn. Took 3.212s. FSC Loose Mask... ========= sending heartbeat 0.143 at 100.816 radwn. 0.5 at 90.359 radwn. Took 11.471s. FSC Tight Mask... ========= sending heartbeat 0.143 at 103.482 radwn. 0.5 at 96.642 radwn. Took 10.515s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (155, 1, 2007, 81) 218 block size 256 grid size (155, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 672, 44) 894 block size 256 grid size (155, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 224, 24) 3604 block size 256 grid size (155, 14, 1) global compute_resid_pow with (155, 1, 80, 12) 14456 block size 256 grid size (155, 5, 1) global compute_resid_pow with (155, 1, 32, 8) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 16, 4) 16840 block size 256 grid size (155, 1, 1) global compute_resid_pow with (155, 1, 8, 4) 16840 block size 128 grid size (155, 8, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 2007, 81) 218 block size 256 grid size (155, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 672, 44) 894 block size 256 grid size (155, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 224, 24) 3604 block size 256 grid size (155, 14, 1) global compute_resid_pow with (155, 1, 80, 12) 14456 block size 256 grid size (155, 5, 1) global compute_resid_pow with (155, 1, 32, 8) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 16, 4) 16840 block size 256 grid size (155, 1, 1) global compute_resid_pow with (155, 1, 8, 4) 16840 block size 128 grid size (155, 8, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (155, 1, 2007, 81) 218 block size 256 grid size (155, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 672, 44) 894 block size 256 grid size (155, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 224, 24) 3604 block size 256 grid size (155, 14, 1) global compute_resid_pow with (155, 1, 80, 12) 14456 block size 256 grid size (155, 5, 1) global compute_resid_pow with (155, 1, 32, 8) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 16, 4) 16840 block size 256 grid size (155, 1, 1) global compute_resid_pow with (155, 1, 8, 4) 16840 block size 128 grid size (155, 8, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (155, 1, 2007, 81) 218 block size 256 grid size (155, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (155, 1, 672, 44) 894 block size 256 grid size (155, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (155, 1, 224, 24) 3604 block size 256 grid size (155, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (155, 1, 80, 12) 14456 block size 256 grid size (155, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 32, 8) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16840 block size 256 grid size (500, 1, 1) global compute_resid_pow with (155, 1, 16, 4) 16840 block size 256 grid size (155, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16840 block size 128 grid size (500, 8, 1) global compute_resid_pow with (155, 1, 8, 4) 16840 block size 128 grid size (155, 8, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) global compute_resid_pow with (155, 1, 19, 21) 16840 block size 256 grid size (155, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16840 block size 256 grid size (500, 2, 1) FSC No-Mask... 0.143 at 96.143 radwn. 0.5 at 74.696 radwn. Took 2.147s. FSC Spherical Mask... 0.143 at 99.034 radwn. 0.5 at 82.479 radwn. Took 2.775s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 102.301 radwn. 0.5 at 95.539 radwn. Took 13.143s. FSC Tight Mask... ========= sending heartbeat 0.143 at 106.691 radwn. 0.5 at 99.467 radwn. Took 11.163s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (263, 1, 2007, 81) 218 block size 256 grid size (263, 126, 1) global compute_resid_pow with (263, 1, 672, 44) 894 block size 256 grid size (263, 42, 1) global compute_resid_pow with (263, 1, 224, 24) 3604 block size 256 grid size (263, 14, 1) global compute_resid_pow with (263, 1, 80, 12) 14456 block size 256 grid size (263, 5, 1) global compute_resid_pow with (263, 1, 32, 8) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (263, 1, 16, 4) 17878 block size 256 grid size (263, 1, 1) global compute_resid_pow with (263, 1, 8, 4) 17878 block size 128 grid size (263, 8, 1) global compute_resid_pow with (263, 1, 19, 21) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (263, 1, 19, 21) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 17878 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 16, 4) 17878 block size 256 grid size (264, 1, 1) global compute_resid_pow with (264, 1, 8, 4) 17878 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 17878 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 17878 block size 256 grid size (264, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (263, 1, 2007, 81) 218 block size 256 grid size (263, 126, 1) global compute_resid_pow with (263, 1, 672, 44) 894 block size 256 grid size (263, 42, 1) global compute_resid_pow with (263, 1, 224, 24) 3604 block size 256 grid size (263, 14, 1) global compute_resid_pow with (263, 1, 80, 12) 14456 block size 256 grid size (263, 5, 1) global compute_resid_pow with (263, 1, 32, 8) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (263, 1, 16, 4) 17878 block size 256 grid size (263, 1, 1) global compute_resid_pow with (263, 1, 8, 4) 17878 block size 128 grid size (263, 8, 1) global compute_resid_pow with (263, 1, 19, 21) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (263, 1, 19, 21) 17878 block size 256 grid size (263, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256========= sending heartbeat grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 17878 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 17878 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 17878 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 17878 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 16, 4) 17878 block size 256 grid size (264, 1, 1) global compute_resid_pow with (264, 1, 8, 4) 17878 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 17878 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 17878 block size 256 grid size (264, 2, 1) FSC No-Mask... 0.143 at 97.010 radwn. 0.5 at 75.859 radwn. Took 2.447s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.927 radwn. 0.5 at 87.908 radwn. Took 2.891s. FSC Loose Mask... ========= sending heartbeat 0.143 at 103.426 radwn. 0.5 at 96.835 radwn. Took 12.680s. FSC Tight Mask... ========= sending heartbeat 0.143 at 107.584 radwn. 0.5 at 100.525 radwn. Took 10.935s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 32, 8) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 18190 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 18190 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (109, 1, 2007, 81) 218 block size 256 grid size (109, 126, 1) global compute_resid_pow with (109, 1, 672, 44) 894 block size 256 grid size (109, 42, 1) global compute_resid_pow with (109, 1, 224, 24) 3604 block size 256 grid size (109, 14, 1) global compute_resid_pow with (109, 1, 80, 12) 14456 block size 256 grid size (109, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (109, 1, 32, 8) 18190 block size 256 grid size (109, 2, 1) global compute_resid_pow with (109, 1, 16, 4) 18190 block size 256 grid size (109, 1, 1) global compute_resid_pow with (109, 1, 8, 4) 18190 block size 128 grid size (109, 8, 1) global compute_resid_pow with (109, 1, 19, 21) 18190 block size 256 grid size (109, 2, 1) global compute_resid_pow with (109, 1, 19, 21) 18190 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 16, 4) 18190 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 18190 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256========= sending heartbeat grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 18190 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 18190 block size 128 grid size (108, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 18190 block size 256 grid size (500, 1, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 18190 block size 128 grid size (500, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 18190 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18190 block size 256 grid size (500, 2, 1) FSC No-Mask... 0.143 at 101.462 radwn. 0.5 at 94.068 radwn. Took 2.258s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 103.694 radwn. 0.5 at 97.269 radwn. Took 2.847s. FSC Loose Mask... ========= sending heartbeat 0.143 at 108.951 radwn. 0.5 at 101.087 radwn. Took 11.648s. FSC Tight Mask... ========= sending heartbeat 0.143 at 117.157 radwn. 0.5 at 104.456 radwn. Took 10.285s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 116.818 radwn. 0.5 at 104.165 radwn. Took 22.920s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 16, 4) 21436 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21436 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (109, 1, 2007, 81) 218 block size 256 grid size (109, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (109, 1, 672, 44) 894 block size 256 grid size (109, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (109, 1, 224, 24) 3604 block size 256 grid size (109, 14, 1) global compute_resid_pow with (109, 1, 80, 12) 14456 block size 256 grid size (109, 5, 1) global compute_resid_pow with (109, 1, 32, 8) 21436 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (109, 1, 16, 4) 21436 block size 256 grid size (109, 1, 1) global compute_resid_pow with (109, 1, 8, 4) 21436 block size 128 grid size (109, 8, 1) global compute_resid_pow with (109, 1, 19, 21) 21436 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (109, 1, 19, 21) 21436 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 32, 8) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21436 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21436 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256========= sending heartbeat grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21436 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21436 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21436 block size 256 grid size (500, 2, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21436 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21436 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21436 block size 256 grid size (108, 2, 1) FSC No-Mask... 0.143 at 101.818 radwn. 0.5 at 94.538 radwn. Took 2.714s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.365 radwn. 0.5 at 97.578 radwn. Took 3.074s. FSC Loose Mask... ========= sending heartbeat 0.143 at 110.797 radwn. 0.5 at 101.396 radwn. Took 11.292s. FSC Tight Mask... ========= sending heartbeat 0.143 at 117.734 radwn. 0.5 at 104.839 radwn. Took 10.382s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 117.312 radwn. 0.5 at 104.571 radwn. Took 22.986s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 32, 8) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21614 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21614 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (109, 1, 2007, 81) 218 block size 256 grid size (109, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (109, 1, 672, 44) 894 block size 256 grid size (109, 42, 1) global compute_resid_pow with (109, 1, 224, 24) 3604 block size 256 grid size (109, 14, 1) global compute_resid_pow with (109, 1, 80, 12) 14456 block size 256 grid size (109, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (109, 1, 32, 8) 21614 block size 256 grid size (109, 2, 1) global compute_resid_pow with (109, 1, 16, 4) 21614 block size 256 grid size (109, 1, 1) global compute_resid_pow with (109, 1, 8, 4) 21614 block size 128 grid size (109, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (109, 1, 19, 21) 21614 block size 256 grid size (109, 2, 1) global compute_resid_pow with (109, 1, 19, 21) 21614 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 32, 8) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21614 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21614 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21614 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21614 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (108, 1, 19, 21) 21614 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21614 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21614 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21614 block size 256 grid size (500, 2, 1) FSC No-Mask... 0.143 at 101.875 radwn. 0.5 at 94.634 radwn. Took 2.561s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.594 radwn. 0.5 at 97.666 radwn. Took 3.078s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.160 radwn. 0.5 at 101.407 radwn. Took 14.097s. FSC Tight Mask... ========= sending heartbeat 0.143 at 117.856 radwn. 0.5 at 104.782 radwn. Took 11.270s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 117.645 radwn. 0.5 at 104.462 radwn. Took 24.395s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size========= sending heartbeat (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (108, 1, 32, 8) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21744 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21744 block size 128 grid size (108, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (109, 1, 2007, 81) 218 block size 256 grid size (109, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (109, 1, 672, 44) 894 block size 256 grid size (109, 42, 1) global compute_resid_pow with (109, 1, 224, 24) 3604 block size 256 grid size (109, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (109, 1, 80, 12) 14456 block size 256 grid size (109, 5, 1) global compute_resid_pow with (109, 1, 32, 8) 21744 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (109, 1, 16, 4) 21744 block size 256 grid size (109, 1, 1) global compute_resid_pow with (109, 1, 8, 4) 21744 block size 128 grid size (109, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (109, 1, 19, 21) 21744 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (109, 1, 19, 21) 21744 block size 256 grid size (109, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (108, 1, 32, 8) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21744 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21744 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (108, 1, 2007, 81) 218 block size 256 grid size (108, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21744 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21744 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (108, 1, 672, 44) 894 block size 256 grid size (108, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21744 block size 256 grid size (500, 2, 1) global compute_resid_pow with (108, 1, 224, 24) 3604 block size 256 grid size (108, 14, 1) global compute_resid_pow with (108, 1, 80, 12) 14456 block size 256 grid size (108, 5, 1) global compute_resid_pow with (108, 1, 32, 8) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 16, 4) 21744 block size 256 grid size (108, 1, 1) global compute_resid_pow with (108, 1, 8, 4) 21744 block size 128 grid size (108, 8, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) global compute_resid_pow with (108, 1, 19, 21) 21744 block size 256 grid size (108, 2, 1) FSC No-Mask... 0.143 at 101.875 radwn. 0.5 at 94.717 radwn. Took 2.632s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.703 radwn. 0.5 at 97.700 radwn. Took 2.937s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.103 radwn. 0.5 at 101.385 radwn. Took 11.340s. FSC Tight Mask... ========= sending heartbeat 0.143 at 118.016 radwn. 0.5 at 104.728 radwn. Took 10.219s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 117.984 radwn. 0.5 at 104.290 radwn. Took 21.429s. ---- Computing FSC with mask 2.00 to 6.00 FSC No-Mask... 0.143 at 101.875 radwn. 0.5 at 94.717 radwn. Took 1.929s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.703 radwn. 0.5 at 97.700 radwn. Took 2.710s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.103 radwn. 0.5 at 101.385 radwn. Took 9.697s. FSC Tight Mask... ========= sending heartbeat 0.143 at 123.448 radwn. 0.5 at 109.173 radwn. Took 9.711s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 117.641 radwn. 0.5 at 103.137 radwn. Took 20.116s. ---- Computing FSC with mask 2.25 to 7.00 FSC No-Mask... 0.143 at 101.875 radwn. 0.5 at 94.717 radwn. Took 1.921s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.703 radwn. 0.5 at 97.700 radwn. Took 2.714s. FSC Loose Mask... 0.143 at 111.103 radwn. 0.5 at 101.385 radwn. Took 9.781s. FSC Tight Mask... ========= sending heartbeat 0.143 at 121.814 radwn. 0.5 at 108.040 radwn. Took 9.655s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 118.359 radwn. 0.5 at 104.289 radwn. Took 21.414s. ---- Computing FSC with mask 2.50 to 8.00 FSC No-Mask... 0.143 at 101.875 radwn. 0.5 at 94.717 radwn. Took 1.920s. FSC Spherical Mask... 0.143 at 104.703 radwn. 0.5 at 97.700 radwn. Took 2.731s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.103 radwn. 0.5 at 101.385 radwn. Took 9.707s. FSC Tight Mask... ========= sending heartbeat 0.143 at 120.399 radwn. 0.5 at 107.243 radwn. Took 9.964s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 118.557 radwn. 0.5 at 104.816 radwn. Took 21.501s. ---- Computing FSC with mask 2.75 to 9.00 FSC No-Mask... ========= sending heartbeat 0.143 at 101.875 radwn. 0.5 at 94.717 radwn. Took 1.919s. FSC Spherical Mask... 0.143 at 104.703 radwn. 0.5 at 97.700 radwn. Took 2.750s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.103 radwn. 0.5 at 101.385 radwn. Took 9.815s. FSC Tight Mask... ========= sending heartbeat 0.143 at 119.498 radwn. 0.5 at 106.632 radwn. Took 10.247s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 118.568 radwn. 0.5 at 105.003 radwn. Took 20.510s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/site-packages/skcuda/cublas.py:284: UserWarning: creating CUBLAS context to get version number warnings.warn('creating CUBLAS context to get version number') /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) ========= main process now complete. ========= monitor process now complete.